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SECTION  I 


BACKGROUND 

The  random  word  generator  is  a PL/I  program  designed  to  run  on 
Honeywell's  Multiplexed  Information  and  Computer  System  (Multics),  a 
large  timesharing  system.  The  purpose  of  the  program  is  to  generate 
passwords  that  serve  to  authenticate  the  identities  of  users  of  Mul- 
tics . 


Users  of  the  standard  Multics  system  authenticate  themselves  at 
each  login  (start  of  a terminal  session)  by  typing  in  a password  known 
only  to  the  user  and  the  system.  Usually  the  user  selects  a password 
for  himself  at  initial  login  and  can  change  this  password  at  any  sub- 
sequent login.  The  ability  to  change  a password  is  useful  when  a user 
suspects  that  someone  else  may  have  guessed  his  password. 

A password  is  the  key  to  user  identification  and  protection. 

From  previous  experience,  however,  it  has  become  apparent  that  user- 
selected  passwords  are  frequently  fairly  easy  to  guess.  For  example, 
passwords  are  often  the  user's  own  first  name,  a name  of  a family  mem- 
ber, or  his  telephone  number,  [1]  Some  Multics  installations,  such  as 
the  Air  Force  Data  Services  Center  (AFDSC),  are  used  to  process  clas- 
sified information,  and  installations  like  the  AFDSC  cannot  take  a 
chance  that  one  of  their  user's  passwords  will  be  guessed.  The  solu- 
tion to  the  problem  at  the  AFDSC  was  a decision  to  assign  passwords  to 
users,  rather  than  to  allow  users  to  pick  their  own.  [2] 

The  administrative  overhead  of  assigning  a random  password  manu- 
ally whenever  a user  changes  his  password  is  generally  too  much  of  a 
burden  — especially  when  one  considers  that  only  a select  few  indi- 
viduals may  have  access  to  other  users'  passwords.  Instead,  the  pos- 
sibility of  providing  computer  generated  passwords  that  are  printed 
out  at  the  user's  console  on  each  password  change  was  investigated.  A 
user's  request  to  change  his  password  would  then  become  a call  to  a 
system  password  generator  program.  This  password  generator  has  been 
given  the  more  general  and  descriptive  name  of  "random  word  generator" 
in  this  report. 

Section  II  of  this  paper  discusses  the  goals  and  methods  used  by 
the  random  word  generator.  Section  III  contains  implementation  de- 
tails and  discusses  the  word  generating  algorithm.  Section  IV  con- 
tains an  analysis  of  the  algorithm  and  presents  certain  statistics.  A 
sample  of  random  words  generated  can  be  found  in  Appendix  IV. 
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Most  of  this  report  describes  the  random  word  generator  that  is 
being  made  available  for  users  on  Multics.  In  order  to  satisfy  a more 
stringent  criterion  of  "randomness"  a modified  version  of  the  random 
word  generator  has  also  been  prepared  that  generates  random  words  that 
are  all  equally  probable.  This  modified  version  is  discussed  at  the 
end  of  Section  IV. 
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SECTION  II 


METHODOLOGY 


REQUIREMENTS  AND  GOALS 

The  need  for  the  Multics  random  word  generator  program  can  be 
satisfied  by  fulfilling  the  following  two  requirements: 

generate  easily  remembered  words,  and 
make  the  words  difficult  to  guess. 

The  first  requirement  is  very  important  because  users  might  be  in- 
clined to  write  down  passwords  that  are  difficult  to  remember,  thereby 
increasing  the  chances  of  a password  compromise.  Also,  if  too  many 
users  forget  their  passwords,  the  administrative  overhead  of  getting 
the  users  logged  in  again  could  be  greater  than  that  of  distributing 
easier-to-remember  passwords  manually.  The  need  for  the  second  re- 
quirement is  apparent. 

Both  of  these  requirements  are  of  course  far  too  subjective  for 
direct  implementation  as  a computer  program.  It  is  necessary  to  re- 
state these  requirements  in  a much  more  concrete  manner  so  that  mean- 
ingful algorithms  can  be  designed. 

The  requirement  of  "making  the  words  difficult  to  guess"  is  most 
easily  satisfied  by  giving  the  program  the  ability  to  generate  a very 
large  set  of  possible  words,  and  the  ability  to  generate  these  words 
in  a random  manner.  Both  these  capabilities  can  easily  be  achieved, 
and  thus  we  will  be  more  concerned  at  this  point  with  the  first  re- 
quirement . 

Consider  that  the  random  word  generator  either  needs  a large  data 
base  of  words  to  choose  from  --  an  impractical  approach  that  has  been 
discarded  — or  has  to  form  words  out  of  sequences  of  letters  it  cre- 
ates through  some  algorithm.  The  requirement  of  rememberabili ty  can 
then  be  fulfilled  if  these  sequences  of  letters  are  of  one  or  more  of 
the  following  types: 

1.  Sequences  of  letters  that  can  be  easily  visualized,  such  as 

"aabbaa"  or  "xyxyxy" . 

2.  Sequences  of  letters  that  form  real  English  words. 
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3.  Sequences  of  letters  that  form  pronounceable  "words",  but  are  not 

necessarily  real  words. 

Of  these  three  choices,  methods  1 and  2 suffer  from  the  difficul- 
ty of  specifying  a practical  algorithm  for  such  sequences.  Alterna- 
tively, method  1 could  be  implemented  using  rules  that  yield  an  arbi- 
trarily defined  subset  of  all  possible  easy-to-visualize  sequences, 
but  this  subset  is  likely  to  be  small  for  a reasonable  number  of 
rules.  There  is  no  alternative  for  method  2 other  than  storing  a vast 
data  base  of  real  words. 

The  third  method  — that  of  using  pronounceable  sequences  — is 
the  selected  approach.  The  data  base  required  for  this  method  is  rel- 
atively small,  the  rules  can  be  fairly  well-defined,  and  the  set  of 
words  that  can  be  generated  is  quite  large.  Realizing  that  the  more 
"English"  a word  looks  the  easier  it  usually  is  to  remember,  an  at- 
tempt was  made  to  restrict  the  set  of  words  generated  to  those  which 
obeyed  some  kinds  of  rules  of  English  pronunciation.  This  attempt  was 
restructured  as  an  attempt  to  make  it  theoretically  possible  for  the 
word  generator  to  form  most  English  words. 

Because  of  this  goal  of  making  the  generated  words  look  like  Eng- 
lish, the  word  "pronounceable"  in  this  paper  refers  not  just  to  struc- 
tures that  can  be  phonetically  vocalized,  but  to  a set  of  more  re- 
strictive and  English-looking  structures.  For  example,  "tsip"  is  eas- 
ily pronounceable,  but  is  "un-English"  because  of  the  "ts"  at  the  be- 
ginning of  the  word.  A different  type  of  example  is  the  "gh"  combina- 
tion. The  word  "cough"  is  pronounceable  because  "gh"  in  this  context 
can  be  pronounced  like  "f" , but  "ghrom"  is  not  pronounceable  as  "from" 
because  "gh"  never  sounds  like  "f"  at  the  beginning  of  a word. 


PRONOUNCEABLE? 

One  may  wonder  how  a goal  of  "pronounceability"  can  be  attained 
with  well-defined  rules,  considering  how  undefined  and  exception-laden 
the  rules  of  English  pronunciation  are.  The  answer  is  simple:  the 
program  does  not  care  how  a given  word  is  to  be  pronounced  — it  only 
needs  to  make  sure  that  the  word  can  be  pronounced.  For  example, 
"tophat"  could  be  pronounced  "top-hat"  or  "to-fat",  depending  on  the 
reader's  preference.  On  the  other  hand,  "tophsat"  is  only  pronounce- 
able as  "tof-sat",  not  "top-hsat" . Also,  the  vowel  "o"  in  this  word 
might  be  pronounced  in  one  of  several  ways. 

Everyone  knows  that  a given  letter  or  sequence  of  letters  could 
be  pronounced  differently  in  different  contexts,  but  the  program  is 
usually  not  required  to  distinguish  between  the  different  contexts  or 
pronunciations.  Unlike  the  "rules  of  pronunciation",  the  "rules  of 
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pronounceabili ty"  can  be  made  fairly  precise.^ 

However  precise,  the  rules  and  method  used  to  generate  pronounce- 
able words  have  been  arrived  at  in  a "refined"  ad  hoc  manner  and  based 
on  the  author's  intuition.  The  method  is  not  described  in  any  pub- 
lished source.  Hence,  the  words  generated  may  be  considered  pro- 
nounceable only  by  the  author.  Others  may  find  some  of  these  words 
very  difficult  to  pronounce,  as  when  trying  to  pronounce  a foreign 
word  with  a strange  combination  of  letters.  Because  of  this  possible 
bias,  the  program  was  designed  to  incorporate  as  few  global  rules  as 
possible  within  its  text.  An  external  data  base,  a table,  is  used  to 
contain  most  of  the  subjectively  determined  rules.  These  external 
rules  consist  of  "yes"  or  "no"  answers  to  various  questions  asked  by 
the  program.  The  answers  to  the  questions  can  easily  be  modified  to 
suit  the  user's  preference.  "New"  rules  — those  asking  new  kinds  of 
questions  --  cannot  be  added  without  modifying  the  program. 


THE  POOR  APPROACH 

Letters  are  poor  sources  on  which  to  base  rules  of  pronounceabil- 
ity.  Not  only  do  individual  letters  sound  different  in  different 
words,  but  pairs  or  triplets  of  letters  often  form  single  sounds  that 
may  be  unlike  any  of  the  component  letters.  Determining  whether  a 
letter  is  pronounceable  or  "legal"  in  a given  word  often  involves 
knowing  how  the  letter  is  to  be  pronounced,  which  is  in  turn  dependent 
on  such  things  as  its  position  in  the  word  or  syllable  and  adjacent 
letters.  The  large  number  of  details  that  have  to  be  checked  for  each 
letter  makes  determination  of  pronounceability  very  complex. 

THE  IDEAL  APPROACH 

The  ideal  approach  that  will  always  generate  good  pronounceable 
words  would  be  to  relieve  the  program  of  any  notion  of  letters  and  use 


^Phonological  theory  is  a well  developed  science  that  in  part  at- 
tempts to  describe  the  phonetic  structure  of  English  (and  other  lan- 
guages) in  a complete  and  consistent  manner.  The  totality  of  rules 
and  theorems  used  in  such  a description  form  far  too  complex  a system 
for  the  scope  of  the  application  discussed  in  this  report.  Creation 
of  a smaller  subset  of  this  system  --  one  that  might  be  small  enough 
to  implement  and  would  still  give  reasonable  results  — appeared  to  be 
too  vast  an  undertaking.  Thus,  standard  phonological  theory  was  not 
considered  in  this  work. 
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p 

"phonemes"  instead,  A phoneme  is  an  "element"  of  pronunciation  — 
a unit  of  sound  that  cannot  be  usefully  broken  down  into  smaller 
sounds.  For  example,  the  pair  "sh"  as  pronounced  in  English  can  al- 
ways be  represented  as  a single  phoneme;  the  vowel  "a"  can  be  repre- 
sented as  one  of  several  phonemes  depending  on  its  context.  If  the 
rules  could  be  defined,  it  should  be  possible  to  put  together  random 
phonemes  to  form  a pronounceable  phoneme-word. 

Unfortunately,  though  this  method  yields  good  pronounceable  se- 
quences, the  translation  from  phonemes  to  letters  is  very  difficult 
and  very  un-algorithmic . An  example  of  this  difficulty  is  the  phoneme 
representing  the  sound  of  "k".  This  phoneme  can  be  translated  into 
" c " , "k",  or  "ck" . Which  one  should  be  used?  At  the  end  of  a word, 
usually  any  one  of  these  will  work,  and  another  randomization  factor 
has  to  be  included  to  make  the  choice.  At  the  beginning  of  a word, 

"k"  is  always  legal,  "ck"  is  never  legal,  and  "c"  is  legal  only  if  the 
following  letter  is  not  "e" , "i",  or  "y"  (in  which  case  "c"  would  have 
been  pronounced  like  "s").  Then,  to  determine  whether  "c"  is  a candi- 
date the  following  phoneme  must  first  be  translated  into  letters, 
which  may  in  turn  depend  on  other  adjacent  phonemes.  One  can  fix  the 
translation  so  that  the  "k"  sound  is  always  translated  to  the  letter 
"k",  but  then  the  goal  of  being  able  to  generate  most  English  words 
would  be  far  from  satisfied  (not  to  mention  that  the  letter  "c"  would 
never  be  used) . 


THE  COMPROMISE  APPROACH 

One  may  notice  that  with  the  phoneme  method  the  program  "knows" 
how  the  generated  word  is  pronounced  — specifically  not  a requirement 
as  mentioned  earlier.  A compromise  approach  was  chosen  that  uses  sim- 
ple "units",  instead  of  phonemes,  that  consist  of  a single  letter  or  a 
two-letter  pair.  A given  unit  is  considered  by  the  program  as  having 
only  one  "sound"  in  all  its  usages,  although  in  reality  that  unit  may 
be  pronounced  in  many  different  ways. 

Rules  can  be  determined  for  each  unit,  without  regard  to  how 
that  unit  is  pronounced  in  context,  by  merely  stating  a rule  that  in- 
cludes all  usages  of  that  unit.  This  composite  rule  is  usually  sim- 
pler than  all  of  the  individual  rules  for  the  different  pronunciations 
of  that  unit.  For  example,  the  letter  "g"  can  be  treated  as  a unit, 
and  the  rule  for  this  unit  at  the  beginning  of  a word  says  "this  unit 
may  only  be  followed  by  a,  e,  i,  1,  o,  r,  u or  y."  In  some  of  these 
cases  "g"  is  pronounced  soft  and  in  others  hard  — in  fact  there  is  no 
simple  rule  for  how  "g"  is  pronounced  (e.g.,  "gigantic"  and  "giggle"), 
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also  called 


"phonetic  segments"  in  phonology. 
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but  the  program  doesn't  need  to  make  any  distinctions. 

The  3^  units  presently  used  are  listed  below.  These  units  are 
stored  in  a table  and  are  input  to  the  random  word  generator.  A lar- 
ger or  smaller  set  can  be  defined  if  experience  indicates  these  to  be 
unsatisf  actory . 


a 

f 

k 

P v 

ch 

th 

b 

g 

1 

r w 

gh 

wh 

c 

h 

m 

s X 

ph 

qu 

d 

i 

n 

t y 

rh 

ck 

e 

J 

0 

u z 

sh 

Note 

that  the  letter  "q"  is  the 

only  letter  not  appearing  as  a 

one-letter 

unit 

because 

English  usage 

makes 

it  more  convenient  to 

treat  "qu" 

as  a 

unit . 

Many  two-letter  vowel 

combinations,  such  as 

"ea",  "ie" 

, "ai" 

, etc., 

that  should  be  considered  separate  units  are 

not  included  because  little  loss  of  generality  occurs  (i.e.,  the  set 
of  words  that  can  be  generated  is  nearly  the  same  whether  these  vowels 
are  separate  units  or  not).  Also,  double  letter  pairs  like  "11" , "rr" 
and  f,tt”  need  not  be  included  for  similar  reasons.  On  the  other  hand, 
the  pair  ffshw  is  needed  because  words  such  as  "shrink"  and  "wash"  are 
not  pronounceable  when  "s"  and  "h"  are  treated  as  separate  units. 


SYLLABLES 

Besides  the  rules  used  by  the  program,  there  is  a primary  assump- 
tion that  governs  the  formation  of  words:  if  pronounceable  syllables 

are  concatenated  (subject  to  some  minor  restrictions),  they  will  form 
a pronounceable  word.  Thus,  the  task  of  the  random  word  generator  is 
to  form  pronounceable  syllables. 

This  task  requires  precise  definition  of  "syllable";  thus  the 
following  definition  is  made  at  this  point:  a syllable  is  an  arbitrary 
series  of  units  that  contains  exactly  one  or  two  consecutive  vowel 
units.  Vowel  units  are  "a",  "e",  "ilf,  "o"  , "u"f  and  "y".  For  exam- 
ple, the  following  are  legal  syllables  (where  "v"  represents  a vowel 
unit  and  "c"  represents  a non-vowel  unit,  or  consonant  unit): 

ccv  cvvccc  cv  v vv  vc 
and  the  following  are  illegal  syllables: 
cc  (no  vowels) 

vvv  (more  than  2 consecutive  vowels) 

vccvc  (all  vowels  not  consecutive) 

vcv  (all  vowels  not  consecutive) 
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Note  that  each  of  the  last  two  examples  can  possibly  be  split  into  two 
syllables,  such  as  "vc-cvc"  and  "v-cv". 

The  above  definition  of  Syllable”  seems  to  work  in  English  ex- 
cept for  one  common  case:  the  silent  "e"  at  the  end  of  words  or  some- 
times syllables  often  forms  a syllable  containing  two  non-contiguous 
vowels. Of  course,  that  is  because  English  usually  only  requires  a 
vowel  sound  in  a syllable,  and  in  the  case  of  silent  "e"  the  "eM 
should  not  be  considered  a vowel.  The  program,  however,  has  no  way  of 
telling  whether  the  "e"  is  silent.  To  make  matters  worse,  there  are 
words,  such  as  "subtle”,  "bugle",  "little"  that  do.  have  a final  sylla- 
ble whose  only  vowel  is  the  final  (silent)  "e".  These  are  common 
enough  cases  in  English  to  warrant  special  consideration  in  the  word 
generator . 


JUXTAPOSITION 

The  random  word  generator  forms  syllables  from  left  to  right,  by 
combining  random  units  one  at  a time.  For  each  new  unit  the  program 
determines  whether  that  unit  can  legally  be  appended  to  the  units  al- 
ready in  the  syllable.  If  it  cannot,  the  unit  is  discarded  and  an- 
other random  unit  is  tried. 

In  English  the  legality  of  a unit  is  usually  determined  by  check- 
ing immediately  adjacent  units.  Units  separated  from  each  other  fre- 
quently affect  each  other's  pronunciation  but  only  occasionally  deter- 
mine whether  the  construction  is  legal  or  not.  The  random  word  gener- 
ator uses  rules  of  juxtaposition  as  the  bases  for  creating  pronounce- 
able syllables. 

Each  time  the  program  gets  a new  random  unit,  it  forms  a pair 
consisting  of  that  unit  and  the  previous  unit.  This  unit-pair  is 
looked  up  in  a table  and  bits  of  information  are  extracted  that  speci- 
fy what  can  be  done  with  that  pair.  For  example,  the  unit-pair  "rt" 
will  have  bits  specifying  that  the  pair  may  not  begin  a syllable  and 
that  a vowel  must  precede  this  pair  if  it  is  entirely  contained  within 
a syllable.  The  table  may  sometimes  specify  that  a unit-pair  must  al- 
ways be  split  between  two  syllables  (for  example,  the  pair  "kp"), 
which  is  one  way  in  which  a new  syllable  can  be  started.  Some  pairs, 
such  as  "hh" , can  never  appear  together,  even  if  split  between  sylla- 
bles. The  different  types  of  rules  that  can  be  specified  in  the  table 
are  discussed  in  the  next  section. 


^The  vowel  pair  "ue"  in  "baroque"  and  "catalogue"  is  another  excep- 
tion, though  much  less  common. 
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MISCELLANEOUS  ASSUMPTIONS 


Several  more  assumptions  have  to  be  made  before  syllables  can  be 
generated  properly.  Again,  these  assumptions  were  arrived  at  intui- 
tively and  no  claim  is  made  for  their  completeness.  The  assumptions 
discussed  below  are  those  that  have  been  incorporated  into  the  program 
structure,  as  opposed  to  those  that  are  specified  in  external  tables. 
They  are  presented  in  order  of  importance. 

Consecutive  Vowels 


A rule,  in  part  already  stated,  involving  consecutive  vowels, 
says  that  a maximum  of  two  consecutive  vowel  units  is  permitted.  This 
rule  pertains  to  all  consecutive  vowel  units  even  across  syllables. 

The  reason  for  this  extension  across  syllables  is  that  sequences  such 
as  "aiea"  look  "funny"  and  are  sometimes  difficult  to  pronounce,  even 
though  there  can  be  a syllable  split  in  the  middle.  The  English  lan- 
guage itself  "admits"  of  this  difficulty  between  consecutive  words  by 
trying  to  correct  it  in  two  common  cases:  the  use  of  "an"  instead  of 
"a",  and  the  alternate  pronunciation  of  "the",  when  the  following  word 
begins  with  a vowel  sound.  There  are  few  English  examples  of  more 
than  two  consecutive  vowels  (the  "eau"  combination  is  one  of  them). 
Note  that  the  word  "queen"  is  legal  according  to  random  word  generator 
rules  because  "qu"  is  considered  to  be  a consonant  unit. 

A difficulty  with  this  assumption  involves  the  unit  "y" . For 
purposes  of  syllabification,  "y"  must  be  treated  as  a vowel  (i.e.,  a 
syllable  can  contain  the  single  vowel  "y"),  but  for  the  above  assump- 
tion "y"  should  not  be  treated  as  a vowel.  Three-vowel  sequences  in- 
volving "y"  are  very  common:  "eye"  and  "you"  being  two  examples.  Thus 
the  requirement  of  at  most  two  consecutive  vowel  units  must  be  waived 
if  one  of  the  vowels  is  "y". 

The  Vowel  "y» 

In  order  to  solve  the  consecutive  vowel  problem  above  it  sufficed 
to  treat  "y"  always  as  a consonant.  However,  it  should  also  be  legal 
for  "y"  to  be  the  only  vowel  in  a syllable.  Therefore,  for  the  pur- 
poses of  syllabification  only,  the  random  word  generator  treats  "y"  as 
a vowel  only  if  the  "y"  is  not  immediately  preceded  by  a vowel  within 
that  syllable.  The  sequence  "vowel-y-vowel"  would  thus  have  to  be 
split  between  two  syllables,  but  "y-vowel-vowel"  would  not.  The  addi- 
tional rule  about  silent  "e"  below  allows  a "vowel-y-e"  sequence  to 
end  a word. 
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The  Silent  »e" 


The  special  case  of  final  "e"  has  previously  been  mentioned.  The 
final  "e"  in  a word  in  English  is  almost  never  pronounced  and  there- 
fore cannot  be  used  as  the  only  vowel  in  the  last  syllable.  There  is 
no  problem  taking  care  of  such  exceptions  in  a uniform  way.  However, 
there  is  a very  large  set  of  exceptions  to  this  final  "e"  rule:  words 
such  as  "meddle",  "nestle",  "double"  — all  ending  in  "le"  — are  le- 
gal words  in  English,  yet  no  vowel  is  pronounced  in  the  last  syllable. 
The  rules  used  by  the  word  generator  do  not  allow  final  syllables  of 
"ble"  and  "tie"  and  therefore  such  words  cannot  be  generated.  This 
class  of  words  appears  to  be  the  largest  that  cannot  be  handled  by  the 
word  generator.  In  order  to  solve  this  deficiency  it  would  be  neces- 
sary to  first  include  "le"  as  a unit  in  the  table,  and  then  make  spe- 
cial kinds  of  tests  to  determine  whether  this  unit  is  legal  in  a given 
context.  It  is  not  possible,  without  creating  new  rules  specific  to 
this  "le"  unit,  to  specify  the  necessary  restrictions.  Creating  new 
rules  for  this  case  was  considered  feasible,  but  appeared  to  be  too 
awkward  and  so  was  left  out. 

The  Initial  "v" 

The  unit  "y"  may  not  be  the  only  vowel  in  the  first  syllable  of  a 
word  if  the  word  begins  with  "y".  Only  strange  words  like  "yclept" 
violate  this  rule.  This  is  a minor  point  but  must  be  taken  care  of 
explicitly.  Otherwise,  many  strange  words  are  generated. 

Three  Identical  Units 

There  is  nothing  in  the  rules  so  far  stated  that  prohibits  three 
or  more  identical  consecutive  consonants.  This  condition  may  possibly 
be  legal,  provided  that  no  more  than  two  consecutive  consonants  occur 
in  the  same  syllable.  Instead  of  trying  to  force  a syllable  split  be- 
tween such  groups,  the  decision  was  made  to  merely  limit  the  number  of 
consecutive  identical  units  to  two.  Note  that  this  restriction  is  not 
a pronounceability  problem,  but  a case  of  an  un-English-looking  con- 
struction. 


SUMMARY 

The  goal  of  the  random  word  generator  is  to  generate  easily  re- 
membered words  that  are  difficult  enough  to  guess  to  be  suitable  for 
passwords.  This  goal  has  been  translated  into  requirements  of  pro- 
nounceability and  randomness.  An  attempt  was  made  to  include  almost 
all  English  words  in  the  set  of  words  that  can  be  generated,  and  to 
exclude  constructions  that  are  never  found  in  English  words. 
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The  random  word  generator  works  by  forming  pronounceable  sylla- 
bles and  concatenating  them  to  form  a word.  Rules  of  pronounceability 
are  stored  in  a table  for  every  unit  and  every  pair  of  units.  The 
rules  are  used  to  determine  whether  a given  unit  is  illegal  or  legal, 
based  on  its  position  within  the  syllable  and  adjacent  units.  Most 
rules  and  checks  are  syllable  oriented  and  do  not  depend  on  anything 
outside  the  current  syllable.  In  a few  cases  checks  do  extend  outside 
the  current  syllable.  These  case  are: 

1.  Three  identical  consecutive  units 

2.  Three  consecutive  vowel  units 

3.  Silent  "e"  at  the  end  of  a word 

4.  "y"  beginning  a word 

5-  Certain  illegal  pairs  of  units 
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SECTION  III 


IMPLEMENTATION  DETAILS 

The  random  word  generator  is  organized  as  a main  procedure  that 
references  two  tables  and  an  external  procedure.  The  user  supplies 
the  two  tables:  a "unit"  table  that  defines  the  units  (such  as  those 
listed  on  page  7)  and  specifies  rules  about  each  unit,  and  a "digram" 
table  that  specifies  rules  about  all  possible  pairs  of  such  units. 

The  random_unit  subroutine,  which  returns  a random  unit  when  called  by 
random_word_,  must  also  be  provided  by  the  user.  The  method  used  by 
this  subroutine  to  generate  the  random  units  may  be  any  method  desired 
and  based  on  any  distribution.  Such  a distribution  might,  for  exam- 
ple, be  based  on  the  frequency  of  use  of  the  individual  units  in 
English. 


SPECIFICATION  OF  RULES 

As  mentioned  in  Section  II,  the  random  word  generator  uses  two 
types  of  rules:  those  that  are  fixed  and  embodied  in  the  program 
structure  and  those  that  are  variable  and  embodied  in  external  tables. 
The  fixed  rules  are  general  in  that  they  are  not  specific  to  any  one 
letter  or  unit.  The  tables  specify  rules  pertaining  to  individual 
units  or  the  juxtaposition  of  units.  The  tables  will  be  discussed 
first,  followed  by  specification  of  the  internal  rules. 

The  Digram  Table 

This  table  contains  one  entry  for  every  possible  pair  of  units 
(digram),  whether  that  pair  is  allowed  or  not.  Thus,  with  3^  differ- 
ent units,  there  would  be  1156  entries.  The  entry  for  each  pair  con- 
sists of  eight  bits  of  information  that  together  form  the  "rules"  for 
that  particular  digram.  Each  bit  is  a yes  or  no  answer  to  a specific 
question  asked  by  the  program.  The  name  of  each  of  these  bits  and  the 
questions  answered  are  as  follows: 

1 . must_begin  Must  this  pair  begin  a syllable? 

2.  not_begin  Is  this  pair  prohibited  from  beginning  a syllable? 

3.  break  Is  this  pair  illegal  within  a syllable  (i.e.  must  it 

be  split  between  two  syllables)? 
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4.  prefix 

5.  suffix 

6.  end 

7.  not— end 

8.  illegal_jpair 


Must  this  pair  be  preceded  by  a vowel  unit  if  it 
does  not  begin  a syllable? 

Must  this  pair  be  followed  by  a vowel  unit  if  it 
does  not  end  a syllable? 

Must  this  pair  end  a syllable? 

Is  this  pair  prohibited  from  ending  a syllable? 

Is  this  pair  illegal  (even  if  split  between 
syllables)? 


Obviously  all  eight  bits  are  inherently  non-independent.  There 
are  actually  far  fewer  combinations  of  these  eight  bits  that  can  be 
specified.  Out  of  these,  less  than  sixteen  combinations  are  ever  used 
in  practice  due  to  the  structure  of  the  English  language.  Thus,  four 
bits  yielding  16  combinations  would  be  enough.  The  actual  internal 
representation  of  these  bits  only  affects  speed  and  storage  space, 
however,  and  is  not  of  importance  in  this  discussion.  In  addition, 
some  other  application  of  the  random  word  generator  (perhaps  with  a 
different  language)  may  use  more  combinations.  Appendix  I contains 
the  digram  table  currently  in  use  for  the  34  units  defined  on  page  7. 

An  example  will  best  illustrate  the  usage  of  these  bits.  Consid- 
er the  digram  table  entry  for  the  pair  of  units  "f"  and  ”1”  as  shown 
in  Appendix  I.  The  bits  that  are  set  for  !,fl"  are: 

must_begin 
suffix 
not  end 


The  must_begin  bit  says  that  if  an  "fl"  is  encountered  in  a syllable, 
it  must  begin  that  syllable.  The  suffix  bit  says  that  the  unit  fol- 
lowing "fl"  must  be  a vowel  if  "fl”  is  not  the  last  pair  in  a sylla- 
ble. The  not_end  bit  says  that  "fl"  may  not  be  the  last  pair  in  a 
syllable.  The  specification  of  the  digram  "fl”,  thus,  restricts  its 
use  within  a syllable  as  the  first  pair  in  one  of  the  following  six 
contexts: 


fla...,  fie...,  fli...,  flo. . . , flu...,  fly... 

where  "..."  signifies  additional  units  within  the  syllable.  Of 
course,  if  there  are  any  further  restrictions  on  the  use  of  the  pairs 
"la",  "le",  etc.  that  prevent  them  from  appearing  after  the  "f",  these 
restrictions  must  be  taken  into  account.  Note  that  none  of  the  eight 
digram  bits  except  illegal_jpair  apply  when  the  pair  is  split  between 
two  syllables.  If  "fl"  is  split,  the  "1"  becomes  the  first  unit  of 
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the  next  syllable,  and  rules  for  pairs  beginning  with  "1"  must  be  ex- 
amined. A quick  glance  at  the  digram  table  shows  that  all  pairs  be- 
ginning with  "1"  have  the  not_begin  bit  set,  except  the  six  pairs: 

la,  le,  li,  lo,  lu,  and  ly, 
and  processing  can  continue  with  this  information. 

The  random  word  generator  makes  sure  that  at  all  times  the  rules 
specified  in  the  digram  table  are  satisfied  for  every  two  consecutive 
units  in  the  word  being  formed. 

The  Unit  Table 

In  addition  to  rules  for  unit  pairs,  there  is  a table  containing 
four  bits  of  information  pertaining  to  the  individual  units.  For  each 
unit,  the  four  bits  are  as  follows. 

1.  not_begin_syllable 

This  bit  indicates  that  this  unit  may  not  begin  a syllable. 
This  bit  is  redundant  in  that  the  digram  table  can  specify  that  all 
possible  pairs  beginning  with  this  unit  may  not  begin  a syllable. 
The  purpose  for  using  this  bit  is  for  efficiency  — when  generating 
the  first  unit  of  a new  syllable,  the  program  would  otherwise  have 
to  search  through  all  possible  digrams  beginning  with  this  unit  in 
order  to  determine  whether  this  unit  is  legal.  This  bit  is  cur- 
rently set  for  the  units  "x"  and  "ck".  A small  number  of  words  in 
English  do  begin  with  "x",  but  they  are  mostly  technical  or  scien- 
tific terms. 

2.  no_final_split 

This  bit  indicates  that  this  unit,  when  appearing  at  the  end 
of  a word,  must  not  be  the  only  vowel  in  the  final  syllable.  This 
bit  is  only  set  when  the  "vowel"  bit  is  set,  and  is  currently  set 
only  for  the  unit  "e". 

3.  vowel 

This  bit  is  set  for  vowel  units.  It  is  currently  set  for  the 
units:  a,  e,  i,  o,  u,  but  may  also  be  set  for  any  units  consisting 
of  vowel  pairs  or  that  are  to  be  treated  as  vowels  that  one  might 
add  to  the  table  at  some  future  time. 
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4.  alternate  vowel 


This  bit  indicates  that  this  unit  is  to  be  treated  as  either  a 
vowel  or  a consonant,  depending  on  context  as  discussed  in  Section 
II  of  this  report  on  page  9.  This  bit  is  set  only  for  the  unit 

"y". 


Admittedly  these  four  bits  are  highly  specialized  and  at  least 
bits  2 and  4 could  just  as  easily  be  incorporated  into  the  program 
logic  as  tests  for  specific  units.  However,  the  program  actually 
works  with  numbers  representing  units,  rather  than  the  units  them- 
selves, and  the  assignment  of  a particular  number  to  a particular  unit 
is  arbitrary.  By  using  a bit  in  the  unit  table  for  all  special  cases, 
all  references  to  specific  letters  or  units  are  removed  from  the  pro- 
gram. Refer  to  Appendix  I for  the  unit  table  currently  in  use. 

Random  Units 


As  stated  earlier,  the  random  word  generator  requires  the  user  to 
supply  the  subroutine  random_unit.  This  routine  is  called  by  the  word 
generator  each  time  a random  unit  is  needed.  The  random  units  are 
generated  based  on  some  predetermined  distribution.  Of  course,  not 
all  units  thus  generated  will  be  acceptable  to  the  word  generator  in 
every  position  of  the  word:  randoinjjnit  will  be  repeatedly  called  un- 
til an  acceptable  unit  is  returned.  The  actual  distribution  of  legal 
units  is  different  for  every  position  in  a particular  word,  which,  for 
any  unit,  depends  on  the  units  that  precede  it  and  the  digram  and  unit 
tables.  The  random_unit  subroutine  itself  makes  no  tests  for  legal 
units,  but  merely  uses  its  fixed  distribution  each  time  it  is  called. 

The  distribution  of  units  that  is  currently  in  use  along  with  the 
digram  and  unit  tables  discussed  earlier  is  shown  in  Appendix  VI  in 
the  description  of  the  random^ unit_  subroutine.  There  is  another  en- 
try point  in  random_unit  called  random_vowel , which  is  called  by  the 
word  generator  for  efficiency  in  cases  when  it  is  known  that  only  a 
vowel  unit  will  be  acceptable.  The  distribution  of  vowels  returned  by 
this  second  entry  is  also  shown. 

The  Algorithm 

The  digram  table,  the  unit  table,  and  the  random_unit  subroutine 
are  considered  user-supplied  in  that  they  may  be  modified  without  af- 
fecting the  word  generator  program  logic.  The  external  rules  were 
specified  in  the  two  tables.  The  algorithm  used  to  generate  random 
words  based  on  these  external  rules  defines  the  fixed  internal  rules. 
The  internal  rules  cannot  be  modified  without  changing  the  logic  of 
the  algorithm.  The  complete  algorithm  is  shown  in  Appendix  II,  writ- 
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ten  in  a PL/I-like  language,  and  a high  level  flowchart  is  shown  in 
figure  1.  Appendix  III  contains  the  source  program  listing  of 
random_word  which  implements  this  algorithm. 

The  function  of  the  main  body  of  the  algorithm  is  to  determine 
whether  a given  unit,  generated  by  random_unit,  can  be  appended  to  the 
end  of  the  partial  word  formed  so  far.  If  illegal,  the  unit  is  dis- 
carded and  random_unit  is  called  again.  Once  a unit  is  accepted,  var- 
ious state  variables  are  updated  and  a unit  for  the  next  position  in 
the  word  is  tried.  A unit  previously  generated  and  accepted  can  never 
be  discarded. 

The  flowchart  in  figure  1 shows  generally  how  a word  is  built  up. 
The  names  in  all  capitals  (INDEX,  SYLLABLE_LENGTH,  etc.)  are  referen- 
ces to  variables  initialized  within  the  flowchart.  Names  in  quotes 
(e.g. , "syllable_length" ) refer  to  the  bits  in  the  digram  table  or 
unit  table  for  the  last  pair  of  units  or  the  current  unit.  The  array 
UNIT  holds  the  units  of  the  word  as  they  are  generated,  where 
UNIT(INDEX)  is  the  current  unit. 

Beginning  at  the  top  of  the  flowchart,  the  first  unit  of  the  word 
is  selected  at  random  by  random_unit  and  inserted  into  UNIT(l).  If 
this  unit  is  legal,  according  to  rules  in  the  unit  table,  the  second 
unit  of  the  word  is  selected  and  loaded  into  UNIT(2).  This  time  the 
rules  must  be  satisfied  for  both  the  unit  table  entry  for  UNIT(2)  and 
the  digram  table  entry  for  the  pair  [UNIT( 1 ) , UNIT(2) ] . If  a given 
unit  is  not  acceptable,  another  is  tried  in  its  place.  When  the  end 
of  the  word  is  reached  (as  determined  by  the  number  of  letters  desired 
by  the  caller  of  random_word_) , additional  checks  are  made  before  the 
algorithm  can  terminate. 

If  the  digram  table  is  consistent,  there  should  always  be  some 
unit  that  will  be  legal  for  any  legal  state  of  the  algorithm.  How- 
ever, self-consistency  checks  on  the  digram  table  are  extremely  diffi- 
cult to  make.  Therefore,  an  arbitrary  limit  of  100  tries  is  placed  on 
generating  any  particular  unit.  If  100  calls  to  random_unit  fail  to 
yield  a legal  unit,  the  whole  word  is  discarded  and  the  program  starts 
over.  This  100  tries  limit  is  not  explicitly  shown  in  the  flowchart 
but  is  contained  in  the  program  text  (see  Appendix  III). 

Another  observation  concerning  the  100  limit  is  that,  because  the 
program  is  dealing  with  random  events,  it  is  theoretically  possible 
for  100  tries  to  fail  to  yield  a legal  unit  even  though  there  is  a 


^A  "state"  here  is  defined  by  the  values  of  the  state  variables  used 
in  the  algorithm  as  given  in  Appendix  II,  and  includes  the  units  al- 
ready accepted  as  part  of  the  word  being  formed. 
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Figure  1 . Random__Word_  Flowchart 


unit  that  is  legal.  Thus,  in  order  to  prevent  excessively  long  loops, 
it  is  useful  to  place  a limit  on  the  number  of  tries,  even  though  the 
digram  table  may  be  consistent. 


RESULTS 

Appendix  IV  contains  a printout  of  2000  random  words  of  five  to 
eight  letters.  The  length  of  five  to  eight  letters  was  chosen  for 
this  run  because  such  words  are  more  pronounceable  than  longer  words, 
and  fewer  than  five  letters  causes  too  many  duplicates  to  be  genera- 
ted, thus  making  the  words  unsuitable  for  use  as  passwords.  Actually 
the  random  word  generator  has  a capability  of  generating  words  of  any 
length.  The  words  in  the  printout  have  been  sorted  alphabetically 
merely  as  an  aid  to  checking  certain  constructs.  The  possible  rela- 
tionships between  successively  generated  words  depend  on  the  random 
number  generator  in  use,  which  is  outside  the  scope  of  this  discus- 
sion. 


Notice  in  the  printout  that  alongside  each  word  is  the  same  word 
divided  into  syllables  (hyphenated).  An  interesting  by-product  of  the 
algorithm  is  the  ability  to  determine  syllable  divisions  in  the  word 
generated.  In  certain  cases  the  syllable  split  can  not  be  precisely 
defined.  For  example,  the  word  "without"  can  be  split  as  "with-out" 
or  "wi-thout"  according  to  the  rules.  In  such  cases  the  random  word 
generator  makes  an  arbitrary  (but  predetermined)  choice  of  where  to 
split  the  syllables. 

It  may  also  be  that  certain  hyphenations  are  not  the  most  logical 
as  an  aid  to  pronunciation.  An  example  can  again  be  found  in 
"without",  which  would  be  hyphenated  as  "wit-hout"  if  the  "t"  and  "h" 
were  generated  as  individual  units  instead  of  as  a "th"  unit.  The  de- 
cisions about  hyphenation  made  by  the  program  are  built  into  the  algo- 
rithm and  are  based  on  what  the  author  considers  the  most  likely  to  be 
acceptable  in  the  general  case. 
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SECTION  IV 


ANALYSIS 

Of  the  two  requirements  of  the  random  word  generator  stated  on 
page  3>  the  requirement  of  making  the  words  "difficult  to  guess"  was 
stated  as  being  easy  to  achieve  by  giving  the  word  generator  the  capa- 
bility of  generating  a very  large  set  of  words.  The  more  difficult 
requirement  of  pronounceability  guided  the  design,  and  it  was  intui- 
tively assumed  that  a large  enough  set  of  words  to  satisfy  some  cri- 
terion of  randomness  would  automatically  result. 

In  this  section  an  attempt  is  made  to  present  some  quantitative 
measurements  and  statistics  that  may  allow  one  to  determine  whether 
the  word  generator  is  actually  "random  enough"  in  some  sense.  With  a 
tool  as  crucial  to  the  security  of  the  system  as  a password  generator, 
it  must  be  assured  that  the  passwords  really  are  "difficult  to  guess". 

There  is  no  one  quantity  to  be  calculated  that  will  provide  a 
meaningful  description  of  the  random  word  generator's  effectiveness  in 
all  its  possible  applications.  For  example,  in  an  application  where 
the  random  words  are  used  to  create  identifiers  of  individuals,  the 
total  number  of  possible  words  and  the  probability  of  duplication  are 
of  interest.  In  the  application  as  a Multics  password  generator,  du- 
plicates are  not  as  important  as  the  probability  that  a given  user's 
password  will  be  guessed  by  another  user.-*  For  other  applications 
the  probability  distribution  of  the  words  might  be  required. 

It  is  hoped  that  enough  statistical  data  is  provider  in  this  sec- 
tion so  that,  with  sufficient  further  analysis,  most  quantities  of  in- 
terest can  be  calculated.  A complete  statistical  analysis  is  beyond 
the  scope  of  this  discussion.  However,  attention  will  be  focused  on 
areas  of  interest  to  users  of  the  random  word  generator  as  a password 
generator  for  Multics. 


■'In  Multics,  it  is  of  little  value  to  know  a password  without  know- 
ing the  name  of  the  user  to  whom  it  belongs;  i.e.,  one  cannot  login  to 
the  system  merely  by  typing  a password  and  thereby  impersonating  who- 
ever that  password  happened  to  belong  to.  Other  systems  may  actually 
use  the  password  to  identify  rather  than  verify. 
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The  following  four  topics  have  been  chosen  for  consideration: 

total  number  of  different  pronounceable  words, 
probability  of  a given  word  being  generated, 
most  probable  word,  and 
distribution  of  word  probabilities. 

Some  quantitative  measure  of  each  of  the  above  has  been  obtained,  but 
through  empirical  analysis  of  the  random  word  generator's  output  rath- 
er than  through  an  analysis  of  the  algorithm.  Analysis  of  the  algo- 
rithm would  of  course  yield  the  most  precise  statistics,  but  the  com- 
plexity of  the  algorithm  and  its  states,  and  the  large  amount  of  data 
in  the  supporting  tables  (which  might  be  subject  to  change  by  anyone), 
make  such  an  analysis  extremely  difficult  and  somewhat  limited  in  ap- 
plicability. Instead,  minor  modifications  to  the  random  word  genera- 
tor and  some  additional  programs  were  incorporated  to  supply  the  data 
necessary  for  this  analysis.  If  a change  is  made  in  one  of  the 
supporting  tables,  new  data  can  be  obtained  merely  by  re-running  these 
additional  programs. 

It  should  be  noted  that  all  of  the  statistics  and  numerical  fig- 
ures presented  in  this  section  apply  only  when  the  tables  are  set  up 
as  in  Appendix  I.  The  methods  used  to  obtain  the  results,  however, 
apply  to  any  tables  the  user  may  supply. 


NUMBER  OF  WORDS 

The  number  of  possible  random  words,  though  extremely  difficult 
to  determine  by  analyzing  the  algorithm,  can  be  established  to  any  de- 
gree of  accuracy  in  a fairly  simple  manner. 

Consider  all  possible  words  of  a given  length  L that  can  be 
formed  from  the  26  letters  of  the  alphabet,  without  regard  to  pro- 
nounceability . If  N is  the  number  of  such  words,  then 

N = 26L.  (1) 

Out  of  these,  a certain  fraction  f are  "pronounceable"  according  to 
random  word  generator  rules.  The  value  of  f may,  of  course,  depend  on 
L.  If  we  can  determine  f,  we  can  calculate  the  number  of  pronounce- 
able words  n of  a given  length  simply  by 

n = fN.  (2) 

An  estimate  for  the  value  of  f can  be  obtained  by  picking  a ran- 
dom subset  of  size  m out  of  the  N words,  and  finding  out  what  fraction 
of  this  subset  is  pronounceable.  The  larger  the  value  of  m,  the 


26 


smaller  the  probable  error  we  will  have  in  our  estimated  value  of  f. 
Actually  the  accuracy  of  our  estimate  can  be  expressed  in  terms  of  a 
probability  that  its  absolute  error  is  less  than  a certain  amount. 

Generating  a random  subset  of  N words  of  length  L is  easy  with  a 
uniform  random  number  generator.  With  a small  modification^  the 
random  word  generator  can  be  given  a particular  word,  and  will  "run 
through  its  rules"  to  determine  whether  the  word  is  legal  (i.e.,  pro- 
nounceable). A sample  run  of  100,000  words  of  eight  letters  was  made. 
The  length  of  eight  letters  was  chosen  for  this  run  because  that  is 
the  maximum  length  of  a user's  password  acceptable  to  Multics.  There 
were  2653  acceptable  words  out  of  this  run  of  100,000,  yielding  an  es- 
timate for  f of  .02653.  For  eight  letters, 

N = 268  = 208,827,064,576  (3) 

and  the  estimated  value  for  n is 

.02653N  = 5.540  x 109.  (4) 

The  accuracy  of  the  estimate  for  f as  determined  above  can  be 
calculated  as  a confidence  interval  for  f.  This  confidence  interval 
is  written  approximately,  for  large  m,  in  the  form 


k/m  + z\ 

k/m( 1-k/m) 

■ \J 

m 

where  k is  the  number  of  acceptable  words  out  of  the  sample  of  m,  and 
z is  an  appropriate  percentage  point  of  the  standard  normal  distribu- 
tion. For  example,  we  might  be  interested  in  a 95%  confidence  inter- 
val, which  corresponds  to  a value  of  z - 1.96.  In  the  sample  of 
100,000  above,  this  yields 

[ .02653  ± .00099 ] (6) 

For  other  confidence  regions,  and  for  sample  runs  of  words  of  differ- 
ent lengths,  see  figure  2. 


8The  only  change  is  to  use  a special  version  of  the  random_unit  sub- 
routine (which  is  user-supplied)  that  supplies  units  of  a known  word 

rather  than  random  units. 


27 


confidence 
word  length  range 

number 

minimum 

of  words 
maximum 

99.9$ 

qq« 

6 letters 

9o$ 

90% 

1.745x10? 

1.761x101 

1.770x10; 

1.782x10' 

1 . 896x 1 0l 

1 .880x10? 
1 .867x10? 
1.858x10' 

99.9% 

8 letters  95% 

90$ 

5. 191x109 
5.269x109 
5. 331x109 
5.363x109 

5.889x109 
5 . 8 1 2x  1 09 
5.787x109 
5.714x109 

99.9$ 

99$ 

10  letters 

90% 

1.4b4x1012 
1 . 499x1 01 2 
1.535x10  ; 
1. 546x10 12 

1.794x1012 
1.759x10  2 
1.735x10  2 
1 . 7 1 2x1 01 2 

Figure  2.  Number  of  Words  of  6,  8 and  10  Letters 


PROBABILITY  OF  A WORD 

The  words  produced  by  the  random  word  generator  are  not  all 
equally  probable  for  two  reasons.  First,  different  units  have  differ- 
ent probabilities  of  being  generated  by  random_unit.  Second,  not  all 
units  thus  generated  are  always  acceptable.  The  probability  of  a giv- 
en word  must  be  calculated  by  examining  the  conditional  probabilities 
of  the  individual  units  in  that  word. 

Since  random  words  are  created  left  to  right,  at  a given  point 
during  the  creation  of  a word  the  units  accepted  so  far  determine 
which  units  may  follow.  Thus  the  probability  of  a particular  unit  ap- 
pearing in  a particular  position  of  a word  is  the  ratio  of  that  unit's 
probability  (of  being  returned  by  random_unit)  to  the  total  probabili- 
ty of  all  the  units  that  are  legal  in  that  position.  This  calculation 
can  be  made  for  each  unit  based  only  on  the  units  that  precede  it.  In 
order  to  calculate  the  probability  of  a particular  word,  the  probabil- 
ities of  the  individual  units  in  that  word  are  determined  in  this  man- 
ner and  then  multiplied  together. ? 


^The  100-try  limit  discussed  on  page  15  may  cause  entire  words  to  be 
rejected  even  though  some  units  were  accepted.  However,  test  runs 
have  shown  that  the  100-try  limit  is  almost  never  reached. 
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The  method  described  above  works  because,  for  each  position  of 
the  word,  the  random  word  generator  keeps  trying  random  units  until  a 
legal  unit  is  found.  The  unacceptable  units  play  no  part  in  the  prob- 
ability that  a particular  legal  unit  will  appear.  For  example, 
suppose  in  a given  position  of  a particular  word  the  only  legal  units 
are  "e"  and  "a".  If  it  is  known  that  the  probability  of  "e"  appearing 
at  random  is  .057,  and  the  probability  of  "a"  is  .047,  then  the  proba- 
bility that  the  unit  will  be  an  "e"  is  .057/ ( .057+. 047) . 

Since  the  random  word  generator  does  not  throw  out  a unit  once  it 
has  been  accepted,  it  is  merely  necessary  to  multiply  the  individual 
conditional  unit  probabilities  together  to  arrive  at  the  probability 
of  the  word.  Note  that  this  probability  only  applies  to  words  of  a 
given  length  (i.e.,  the  length  of  the  word  whose  probability  is  calcu- 
lated). The  random  word  generator  does  not  pick  a length,  but  is 
asked  to  generate  a word  of  a specified  length.  If  random  lengths  are 
supplied  to  the  random  word  generator,  the  distribution  of  these  ran- 
dom lengths  must  be  figured  into  the  probability  of  the  word  calculat- 
ed . 


The  special  program  described  in  the  previous  subsection  that 
"gives"  the  random  word  generator  known  words  was  modified  to  calcu- 
late the  probability  of  the  known  word  in  the  manner  described.  The 
answer  is  exact  in  most  cases,  and  the  method  will  work  regardless 
of  the  definition  of  the  units,  the  tables,  or  the  nature  of  the  algo- 
rithm. The  only  restriction  is  that  the  word  generator  not  discard 
units  that  have  already  been  accepted  in  a given  position  of  a word, 
and  that  the  distribution  of  the  units  returned  by  randoinjjnit  remain 
constant  during  the  formation  of  the  word. 


MOST  PROBABLE  WORD 

The  "most  probable"  word  (or  words)  and  its  probability  as  deter- 
mined in  the  above  manner  is  meaningful  to  those  interested  in  the 
difficulty  of  guessing  random  words.  In  the  password  application,  for 


Q 

Some  words  can  be  divided  into  units  in  one  of  two  ways.  For  exam- 
ple, "w-i-t-h-o-u-t"  and  "w-i-th-o-u-t"  are  two  ways  of  specifying  the 
units  of  "without",  both  of  which  are  legal.  An  exact  calculation  of 
the  probability  of  this  word  would  require  adding  the  probabilities  of 
both  forms.  In  general,  however,  the  probability  of  the  version  that 
contains  more  units  (i.e.,  "w-i-t-h-o-u-t")  is  much  lower  because  of 
the  extra  unit,  and  thus  makes  little  difference  in  the  total  proba- 
bility of  the  word.  In  calculating  probabilities  of  words  containing 
two-letter  units  that  may  possibly  be  split  into  two  one-letter  units 
the  calculation  is  based  on  the  word  with  the  two-letter  units. 
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example,  it  does  not  matter  if  there  are  one  billion  random  words  if 
the  most  probable  word  has  a probability  as  high  as  50$  (even  though 
the  probability  of  all  other  words  may  be  small).  A systematic  method 
for  guessing  a particular  user's  password  would  be  to  first  try  the 
most  probable  word  and  work  down  from  there.  If  the  first  word  tried 
has  a high  probability,  a large  set  of  legal  words  is  of  little  value. 

As  important  as  this  statistic  is,  it  appears  that  only  an  ex- 
tremely complex  analysis  will  yield  the  most  probable  word.  The  obvi- 
ous method  of  selecting  only  the  most  probable  units  to  form  a proba- 
ble word  does  not  work.  For  example,  two  of  the  most  probable  units 
are  "e"  and  "t".  One  might  expect  that  the  most  probable  six  letter 
word  is  something  like  "teetee" . Actually,  a word  like  "heehee”  is 
almost  twice  as  probable.  A simple  calculation  can  show  that  the 
first  two  units  of  a word  are  much  more  likely  to  be  "he"  than  "te". 
Sven  though  "t”  has  twice  the  probability  of  being  first,  the  set  of 
legal  units  following  ”t"  is  greater  than  the  set  of  units  following 
"h".  It  turns  out  that  with  the  tables  in  use  there  are  only  six 
units  that  may  follow  f,hM , whereas  there  are  eight  that  may  follow 
”t”.  The  probability  that  one  of  those  six  will  be  "e"  is  fairly 
high.  The  low  probability  of  "h"  mutiplied  by  the  high  probability  of 
”e"  yields  a value  greater  than  the  probability  of  the  pair  "te". 

An  empirical  approach  to  arriving  at  the  most  probable  word  might 
be  to  generate  a large  number  of  random  words  and  to  calculate  their 
probabilities.  Unfortunately,  even  the  most  probable  word  may  have  a 
probability  sufficiently  low  so  that  millions  of  words  might  be  gener- 
ated before  the  most  probable  word  appears.  Moreover,  one  would  have 
no  assurance  that  any  particular  word  really  is  the  most  probable. 

Once  more,  intuition  and  a "feeling”  of  the  rules  and  restric- 
tions of  the  algorithm  were  relied  upon.  The  utility  programs  previ- 
ously described  made  it  easy  to  try  many  expected  high-probability 
words  manually.  In  this  way,  a guess  of  the  highest  probability  words 
of  6,  7,  8 and  10  letters  has  been  made.  The  words  are  listed  below, 
along  with  their  probabilities.  There  may  actually  be  several  words 
of  each  length  with  the  same  probability.  The  results  below  only  ap- 
ply if  the  specific  digram  and  unit  tables  listed  in  Appendix  I are 
used,  and  if  the  distribution  of  the  units  is  as  listed  in  that  appen- 
dix. 


^One  should  also  consider  that  the  use  of  two-letter  units  increases 
the  probability  of  certain  words.  The  six-letter  word  "quequo"  is  an 
order  of  magnitude  more  probable  that  "teetee”,  because  it  actually 
only  contains  four  units  (qu-e-qu-o),  even  though  the  probability  of 
"qu"  coming  from  the  random_unit  is  very  low. 
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word 

P 

1/p 

quethe 

2.45x10~6 

408,000 

squequo 

1 ,64x10-7 

6,098,000 

queshquo 

2.19x10~8 

45,662,000 

queshquesh 

1 .8lx10-10  5 

,525,000,000 

The  probabilities  above  only  apply  to  words  of  the  specific  length 
shown.  For  example,  if  the  word  generator  is  asked  to  generate  words 
of  a random  length  of  6,  7 or  8 letters,  and  each  length  is  equally 
likely,  then  the  probabilities  above  are  multiplied  by  1/3.  Of 
course,  the  probability  of  the  six  letter  word  is  so  high  that  the 
other  two  words  are  of  little  interest  if  six  letter  words  are  al- 
lowed . 


DISTRIBUTION  OF  PROBABILITIES 

The  ability  to  calculate  the  probability  of  a given  word,  and  the 
total  number  of  words  allows  us  to  arrive  at  an  approximate  distribu- 
tion of  the  probabilities  of  the  pronounceable  words,  from  most  proba- 
ble to  least  probable.  This  distribution  yields  a kind  of  profile  of 
the  word  generator  that  may  be  the  best  overall  measure  of  the  word 
generator's  effectiveness.  One  method  of  arriving  at  such  a distribu- 
tion is  outlined  below.  As  with  the  number  of  words,  the  accuracy  of 
the  distribution  curve  depends  on  the  size  of  the  sample  of  random 
words  used. 

Assume  that  all  n pronounceable  words  of  length  L are  listed  in 
order  of  probability,  and  that  a "word  number"  x,  0 running  from  1 
to  n,  is  assigned  to  each  word,  where  x = 1 for  the  most  probable 
word.  Let  p(x)  be  the  probability  of  word  x.  If  we  had  all  n words, 
we  could  plot  x against  p(x)  as  in  figure  3 to  obtain  a series  of 
points.  The  distribution  p(x)  will  be  loosely  referred  to  as  a 
"curve"  although  strictly  it  is  not  a continuous  function.  Of  course, 
p(x)  is  monotonically  non-increasing  by  definition.  The  area  under 


^The  letter  "x"  has  been  chosen  for  the  word  number  instead  of  the 
more  obvious  choice  of  "i"  to  represent  an  integer  in  order  to  be  more 
consistent  with  the  notation  generally  used  for  some  of  the  calcula- 
tions in  the  following  pages  that  treat  x as  a continuous  variable. 
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I A—  45 ,671 


the  curve  is  unity,  or  more  precisely 


n 

P(  x ) = i.  (7) 

x=  1 


Once  the  curve  is  obtained,  quantities  like  the  total  probability  of 
the  m most  probable  words,  the  probability  of  duplicates  within  a cer- 
tain number  of  tries,  etc.,  can  be  calculated  or  measured  from  a graph 
of  the  curve. 


X 


Figure  3.  Distribution  of  Probabilities  of  Random  Words 


Determination  of  Distribution 


If  all  n pronounceable  words  (and  probabilities)  were  available, 
producing  p(x)  exactly  would  be  no  problem.  In  reality,  we  can  only 
obtain  a certain  fraction  of  the  n words.  If  we  could  in  some  way  se- 
lect every  millionth  word  in  the  ordered  list  of  n words,  we  could 
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still  estimate  a curve  of  p(x)  by  merely  plotting  every  millionth 
point  in  figure  3 and  interpolating  to  get  the  values  in  between.  The 
accuracy  of  such  a plot  will  depend  on  the  "smoothness”  of  p(x)  in 
some  sense  (and  of  course  on  the  method  used  to  make  the  interpola- 
tion) . 

There  is  no  direct  way  to  arrive  at  every  millionth  word  in  the 
list.  We  can  generate  k random  words  but  we  have  no  way  of  knowing 
what  their  positions  are  in  the  list  (i.e.,  their  values  of  x).  In 
fact,  if  we  generate  k random  words,  their  values  of  x will  not  be 
evenly  distributed  in  the  interval  [1,  n],  but  will  be  weighted  to- 
wards the  lower  end  since  the  words  of  higher  probability  are  more 
likely  to  appear  at  random.  It  is  possible,  however,  to  pick  a random 
subset  of  the  n words  that  is.  evenly  distributed  in  the  interval. 

The  uniform  random  word  generator  discussed  near  the  top  of  page 
21  can  be  used  to  provide  a large  enough  set  of  equally  likely  random 
words  so  that  the  desired  number  k of  these  will  be  pronounceable. 

The  k words  thus  obtained  can  be  assumed  to  be  equally  spaced  in  the 
interval  [1,  n]  because  they  were  arrived  at  in  a manner  totally  inde- 
pendent of  their  probabilities.  That  is  to  say,  the  least  probable  of 
the  k words  has  just  as  high  a probability  of  appearing  (using  the 
uniform  generator)  as  does  the  most  probable  word. 

An  approximate  graph  of  p(x)  was  obtained  by  taking  the  2653  pro- 
nounceable words  used  to  estimate  the  value  of  n in  (4)  and  ordering 
them  according  to  probability..  Each  word  was  assigned  an  index  i, 

i = 1 , 2,  . . . , k,  (8) 

where  i = 1 for  the  most  probable  of  these  words.  For  each  word,  the 
position  on  the  x-axis  was  determined  by 

in 

x ( i ) = . (9) 

k + 1 

A plot  of  the  probabilities  of  the  2653  words  is  shown  in  figure  4. 
Application  of  the  Distribution 

Figure  4 is  a complete  profile  of  the  word  generator  and  it  can 
be  used  to  measure  various  quantities.  For  example,  the  total  proba- 
bility of  the  m most  probable  words  is  simply  the  area  under  the  curve 
from  x = 0 to  x = m.  The  number  of  words  that  make  up  any  given  frac- 
tion of  the  population  can  also  easily  be  measured. 

Remember  that  in  figure  4 the  value  of  x is  actually  the  "word 
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Figure  4.  Distribution  of  Probabilities  of  2653  Eight  Letter  Words 
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number”  where  x = 1 for  the  most  probable  word  and  x = n for  the  least 
probable.  The  value  of  x(i)  for  i = 1 in  our  sample  of  2653  pro- 
nounceable words  has  a value  of  approximately  2,000,000  as  calculated 
by  (9).  The  most  probable  eight  letter  word  out  of  the  entire  popula- 
tion is  of  course  at  x = 1 . If  we  can  believe  for  a moment  that  fig- 
ure 4 is  an  exact  representation,  we  can  enlarge  the  extreme  leftmost 
end  of  the  curve  where  x is  small  as  in  figure  5,  and  extrapolate  to 
the  left  of  the  point  at  x = 2,000,000  to  double  check  the  determina- 
tion of  the  most  probable  word  on  page  25.  Of  course  this  extrapola- 
tion is  not  mathematically  valid  since  there  is  no  sound  basis  for  as- 
suming that  the  curve  continues  in  any  specific  pattern.  However,  it 
does  appear  that  extrapolation  yields  a value  of  the  most  probable 
word  very  close  to  that  obtained  by  trial  and  error. 

Another  check  on  the  distribution  curve  can  be  made  by  measuring 
the  area  under  the  curve.  In  order  to  approximately  calculate  this 
area,  Simpson's  rule  was  used  where  the  first  point  (at  x = 1)  was  as- 
sumed to  be  the  most  probable  word  as  previously  determined,  and  suc- 
cessive points  are  at  intervals  of  n/2654.  The  area  thus  calculated 
came  out  to  1.006,  only  0.6?  off  the  expected  value  of  1.000. 

Figures  4 and  5 apply  only  to  a specific  sample  run  for  eight 
letter  words.  Appendix  V presents  similar  data  for  six  and  ten  letter 
words  as  a comparison.  Of  course,  a different  digram  table  or  unit 
table  would  greatly  change  these  distributions. 


AN  ALTERNATIVE  METHOD 

The  main  difficulty  in  the  analysis  of  the  random  word  generator 
lies  in  the  complexity  of  the  algorithm.  The  nature  of  the  algorithm 
is  such  that  a highly  asymmetric  distribution  of  probabilities  of 
words  results,  with  some  words  being  many  orders  of  magnitude  more 
probable  than  others.  The  goal  of  the  preceding  analysis  was  to  pro- 
vide information  as  to  the  shape  of  the  probability  distribution  curve 
so  that  the  word  generator's  suitability  for  any  particular  applica- 
tion could  be  examined. 

In  its  application  as  a Multics  password  generator,  the  results 
of  figure  4 may  indicate  that  the  word  generator  is  not  suitable  for 
passwords  due  to  the  high  probability  of  the  words  at  the  leftmost  end 
of  the  curve.  Some  installations  may  need  passwords  that  have  a prob- 
ability less  than  2.19x10“  . It  is  possible  to  improve  this  proba- 
bility by  changing  the  digram  and  unit  tables  and  the  distribution  of 
the  units  returned  by  the  random_unit_  subroutine,  but  it  is  very  dif- 
ficult to  anticipate  the  effect  of  any  particular  change  on  the  proba- 
bility distribution.  Once  the  change  is  made  in  the  tables,  there  is 
no  easy  way  to  determine  what  the  most  probable  word  actually  is. 
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PROBABILITY 
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There  is  an  alternative  method,  however,  that  can  be  employed  without 
changing  any  of  the  tables,  that  yields  a distribution  that  is  much 
easier  to  determine. 


In  order  to  illustrate  this  alternative  method,  assume  that  we 
wish  to  improve  the  word  generator's  distribution  so  that  no  word  is 
more  probable  than  a word  composed  of  six  random  letters.  This  exam- 
ple was  chosen  because  the  six  random  letter  criterion  for  passwords 
is  applied  to  several  systems  in  use  today.  This  criterion  translates 
to  a maximum  probability  of 


has  a probability  of  2.1^xiu  — too  nign  Dy  a factor  of  seven,  and 

Multics  does  not  allow  (nor  would  it  be  desirable  considering  the  re- 
memberability  requirement)  passwords  longer  than  eight  characters. 
Note,  however,  that  the  total  number  of  pronounceable  eight  letter 
words  from  equation  (4)  is  much  greater  than  the  total  number  of  ran- 
dom six  letter  words.  Thus,  if  there  were  some  way  to  force  the  word 
generator  to  generate  all  n pronounceable  words  with  equal  probabili- 
ty, then  the  probability  of  any  particular  word  would  be  1/n  and  the 
analysis  would  be  trivial. 

By  utilizing  the  random  word  generator  in  a slightly  different 
way,  at  an  additional  cost  in  overhead,  it  is  possible  to  force  the 
probabilities  of  all  words  to  be  equal  without  changing  the  total  num- 
ber of  words.  Consider  the  method  used  to  obtain  the  estimated  value 
of  n in  equation  (4).  This  value  was  obtained  by  generating  words  at 
random  such  that  all  eight  letter  words  are  equally  likely,  and  com- 
puting the  fraction  of  those  that  were  pronounceable.  A method  for 
generating  equally  probable  pronounceable  words,  then,  is  to  generate 
equally  likely  random  words  and  test  them  for  pronounceability  until 
an  acceptable  word  turns  up.  All  acceptable  words  thus  generated  are 
equally  probable,  and  the  randomness  criterion  is  satisfied.  Appendix 
VII  contains  the  source  code  and  the  documentation  for  the  two  program 
modules  that  have  been  altered  in  order  to  optionally  produce  uniform- 
ly distributed  random  words.  In  addition,  for  those  interested,  a 
listing  of  the  Multics  encipher^  subroutine  is  included.  This  is  the 
subroutine  used  to  generate  random  numbers. 

The  question  that  arises  is  whether  this  "sampling"  method  is 
feasible  considering  the  possible  additional  computer  time  required 
for  testing  and  rejecting  words.  The  answer  depends  on  the  fraction 
of  words  that  are  accepted,  and  whether  it  is  more  or  less  expensive 


— - = 3.24  x 10-9 
266 


(10) 


for  any  word.  The  most 


word  shown  on  page  25 
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to  test  a word  for  pronounceability  rather  than  to  generate  it. 

The  fraction  of  pronounceable  words  was  determined  on  page  21  for 
eight  letter  words  using  specific  digram  and  unit  tables.  The  value 
.02653  is  an  acceptance  ratio  of  approximately  1 in  37-  Thus  we  would 
expect,  on  the  average,  to  make  37  tries  before  getting  an  acceptable 
word . 


The  time  required  to  generate  pronounceable  eight  letter  words 
with  the  word  generator  is  somewhat  greater  than  the  time  required  to 
generate  a word  at  random  and  to  test  its  pronounceability.  This  is 
because  the  word  generator  does  not  accept  every  unit  supplied  to  it 
by  random_unit  in  the  word  being  formed,  whereas  all  units  of  a pro- 
nounceable word  to  be  tested  are  immediately  accepted.  The  time  re- 
quired to  test  an  unpronounceable  word  is  usually  less  than  the  time 
required  to  test  a pronounceable  word  because  the  whole  word  is  rejec- 
ted as  soon  as  an  illegal  unit  is  encountered.  If  we  expect,  on  the 
average,  to  make  37  tries  to  find  a pronounceable  word,  we  would  ex- 
pect the  time  required  to  do  this  to  be  no  more  37  times  that  required 
to  generate  one  pronounceable  word.  This  is  born  out  by  empirical  ev- 
idence indicating  that  the  average  time  required  to  find  a pronounce- 
able eight  letter  word  by  trial  and  error  is  about  10  times  that  re- 
quired to  generate  one  pronounceable  word.  In  Multics  computer  time, 


the  figures  are  about  .10  second  and  .01  second  per  word 
tively . 
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respec- 


Since  the  number  of  pronounceable  words  is  much  greater  (by  a 
factor  of  18)  than  that  required  to  fulfill  the  six  random  letter  cri- 
terion, it  may  be  possible  to  significantly  increase  the  pronouncea- 
bility of  words  generated  by  modifying  the  tables  to  yield  a smaller 
set  of  possible  words.  Consider  the  possible  results  if,  for  example, 
one  could  delete  17  out  of  every  18  words  in  the  list  in  Appendix  IV 
and  save  only  the  most  pronounceable  ones.  Of  course,  by  restricting 
the  rules  further,  the  "acceptance  ratio"  of  1 in  37  is  decreased, 
thereby  resulting  in  additional  time  spent  finding  a pronounceable 
word.  A decrease  by  a factor  of  18  will  increase  the  average  time  of 
.10  seconds  for  finding  a pronounceable  eight  letter  word  to  several 
seconds.  This  may  not  be  tolerable  for  some  installations. 


Thus,  particularly  when  employing  the  alternative  sampling  method 
for  generating  pronounceable  words,  one  must  weigh  the  advantages  of 


1 The  time  of  .10  second  for  the  sampling  method  was  obtained  by 
using  a modified  version  of  the  random  word  generator  that  discards  a 
partial  word  and  starts  over  any  time  a unit  is  rejected.  In  this  way 
extra  (possibly  unused)  units  are  not  generated  as  in  the  case  when  a 
whole  random  word  is  first  created  before  testing. 
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pronounceability  against  overhead  in  computer  time  in  determining  what 
modifications  to  make  to  the  tables.  The  advantage  of  using  the  sam- 
pling method  is  that  the  only  statistical  quantity  to  be  determined, 
the  total  number  of  words,  is  fairly  easily  estimated.  The  effect  of 
a change  in  the  digram  table  on  this  estimate  can  be  quickly  examined. 
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SECTION  V 


CONCLUSION 


EVALUATION 

The  random  word  generator  described  in  this  report  has  been  suc- 
cessfully implemented  and  demonstrated  on  the  Multics  system.  The 
list  of  2000  random  words  contained  in  Appendix  IV  was  shown  to  vari- 
ous people,  and  it  became  apparent  that  the  degree  to  which  words  are 
considered  pronounceable  varies  a great  deal  among  individuals.  Some 
people  had  many  more  complaints  than  others  about  particular  words, 
in  most  cases,  complaints  were  about  words  containing  certain  combina- 
tions of  units  that  the  individual  did  not  consider  to  be  "legal". 

For  the  most  part,  combinations  considered  illegal  could  be  di- 
rectly eliminated  using  the  rules  of  the  digram  table.  In  other  much 
less  frequent  cases,  eliminating  the  offending  construct  was  either 
impossible  or  would  result  in  eliminating  many  more  constructs  that 
should  be  legal.  As  mentioned  earlier,  there  were  many  fewer  com- 
plaints about  the  shorter  words  of  four  to  six  letters  than  about  the 
longer  ones. 

The  statistics  discussed  in  Section  IV  and  Appendix  V,  if  pre- 
senting an  unsatisfactory  performance  for  a particular  application, 
can  be  improved  by  modifying  the  digram  table  and  unit  table  or  by  al- 
tering the  manner  in  which  the  random  word  generator  is  utilized  as 
discussed  near  the  end  of  Section  IV.  For  example,  by  eliminating  all 
double-letter  units,  the  most  probable  word  will  have  a much  lower 
probability  than  that  indicated  on  page  25.  This  is  done,  however,  at 
the  cost  of  a reduction  in  the  total  number  of  different  words.  Or, 
at  an  increase  in  overhead,  the  alternative  method  for  generating 
words  discussed  on  page  29  can  be  employed.  By  modifying  the  rules  in 
the  digram  table,  particular  statistical  properties  can  be  adjusted, 
but  one  must  be  aware  of  the  interrelationships  between  the  various 
properties  and  performance  features  before  changing  the  tables  to 
achieve  a given  result. 

In  conclusion  it  appears  that  the  tables  that  are  input  to  the 
word  generator  and  the  manner  in  which  it  is  used  could  be  tailored 
for  almost  any  application,  whether  the  main  interest  is  in  pronounce- 
ability  or  performance. 
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OTHER  APPLICATIONS 


Pronounceability  and  randomness  were  the  primary  goals  of  the 
word  generator  in  its  use  for  generating  random  words.  However,  the 
support  program  discussed  in  Section  IV  that  gives  the  word  generator 
a word  to  be  tried,  combined  with  the  word  generator's  ability  to  di- 
vide a word  into  syllables,  partially  supports  the  facility  of  a gen- 
eral purpose  word  hyphenator  for  text  processing.  The  random  word 
generator  can  be  given  any  word  for  hyphenation.  If  the  word  is  ac- 
cepted, the  word  will  be  returned  hyphenated  into  syllables  just  as  if 
it  had  been  randomly  generated.  If  rejected,  the  word  is  illegal  ac- 
cording to  random  word  generator  rules  and  the  tables.  Of  course,  the 
hyphenation  will  not  always  be  the  "right”  one  according  to  the  dic- 
tionary, but  exceptions  are  apt  to  be  few.  Perhaps  some  pre-  or 
post-processing,  combined  with  a list  or  dictionary  of  exceptional 
constructs,  can  yield  a word  hyphenator  of  general  utility. 


1 ? 

An  example  of  a common  exception  is  found  in  words  containing 
"tion",  which  is  hyphenated  "ti-on"  as  two  syllables.  This  and  many 
other  problems  could  be  avoided  by  defining  new  kinds  of  double  letter 
units  and  changing  the  tables.  Such  modifications  may  make  the  word 
generator  unsuitable  for  generating  random  words  because  many  unpro- 
nounceable words  may  be  considered  legal.  However,  when  used  as  a hy- 
phenator the  legality  of  a word  is  of  no  concern. 
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APPENDIX  I 


TABLES 


The  following  pages  list  the  unit  table  and  digram  table  as  de- 
scribed in  Section  III.  The  unit  table  appears  first.  Each  entry  in 
the  unit  table  has  the  following  format: 

n cc  wxyz 

where : 

n is  a unit  number  (1  to  34). 
cc  is  the  unit  ( 1 or  2 letters). 

w is  0 or  1,  representing  the  value  of  "not_begin_syllable" . 
x is  the  value  of  r,no_finalj split11 . 
y is  "vowel", 

z is  "alternate_vowel". 

The  digram  table  follows,  with  each  entry  of  the  following  format: 
abd-cc+ef 

where : 

a is  0 or  1,  which  is  the  value  of  "begin", 
b is  "not_begin". 

d is  "break". 

appears  if  "prefix"  is  set,  otherwise  blank, 
cc  is  a pair  of  units  (2,  3,  or  4 letters). 

+ appears  if  "illegal_pair"  is  set.  If  it  is  that  means 

"suffix"  is  set.  Otherwise  it  is  blank, 
e is  "end", 

f is  "not  end". 
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The  Unit  Table 


1 

a 

0010 

10 

j 

0000 

19 

t 

0000 

28 

ph 

0000 

2 

b 

0000 

11 

k 

0000 

20 

u 

0010 

29 

rh 

0000 

3 

c 

0000 

12 

1 

0000 

21 

V 

0000 

30 

sh 

0000 

4 

d 

0000 

13 

m 

0000 

22 

w 

0000 

31 

th 

0000 

5 

e 

0110 

14 

n 

0000 

23 

X 

1000 

32 

wh 

0000 

6 

f 

0000 

15 

0 

0010 

24 

y 

0001 

33 

qu 

0000 

7 

g 

0000 

16 

P 

0000 

25 

z 

0000 

34 

ck 

1000 

8 

h 

0000 

17 

r 

0000 

26 

ch 

0000 

9 

i 

0010 

18 

3 

0000 

27 

gh 

0000 
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The  Digram  Table 


000 

aa 

+00 

01  1 

bm 

01 

Oil 

cz 

01 

000 

ec 

00 

000 

fo 

00 

000 

ab 

00 

Oil 

bn 

01 

000 

cch+OO 

000 

ed 

00 

01 1 

fp 

01 

000 

ac 

00 

000 

bo 

00 

000 

cgh+OO 

000 

ee 

00 

100 

fr 

01 

000 

ad 

00 

01  1 

bp 

01 

01  1 

eph 

01 

000 

ef 

00 

010 

fs 

00 

000 

ae 

+00 

100 

br 

01 

000 

erh+OO 

000 

eg 

00 

010 

ft 

00 

000 

af 

00 

010 

bs 

00 

01 1 

esh 

01 

Oil 

eh 

01 

000 

fu 

00 

000 

ag 

00 

Oil 

bt 

01 

01  1 

cth 

01 

000 

ei 

01 

01  1 

fv 

01 

011 

ah 

01 

000 

bu 

00 

000 

cwh+OO 

000 

ej 

00 

01  1 

fw 

01 

000 

ai 

00 

01  1 

bv 

01 

010 

cqu- 

-01 

000 

ek 

00 

000 

fx  ■ 

+00 

000 

aj 

00 

Oil 

bw 

01 

000 

cck+OO 

000 

el 

00 

010 

fy 

00 

000 

ak 

00 

000 

bx  - 

<-00 

000 

da 

00 

000 

em 

00 

Oil 

fz 

01 

000 

al 

00 

000 

by 

00 

Oil 

db 

01 

000 

en 

00 

01  1 

fch 

01 

000 

am 

00 

Oil 

bz 

01 

Oil 

dc 

01 

001 

eo 

00 

01  1 

fgh 

01 

000 

an 

00 

Oil 

bch 

01 

010 

dd 

00 

000 

ep 

00 

01  1 

fph 

01 

000 

ao 

+00 

000 

bgh+OO 

000 

de 

00 

000 

er 

00 

000 

frh+OO 

000 

ap 

00 

Oil 

bph 

01 

Oil 

df 

01 

000 

es 

00 

Oil 

fsh 

01 

000 

a r 

00 

000 

brh+OO 

01  1 

bg 

01 

000 

et 

00 

Oil 

fth 

01 

000 

as 

00 

01  1 

bsh 

01 

01  1 

dh 

01 

000 

eu 

00 

000 

fwh+OO 

000 

at 

00 

01  1 

bth 

01 

000 

di 

00 

000 

ev 

00 

Oil 

fqu 

01 

000 

au 

00 

000 

bwh+OO 

Oil 

bj 

01 

000 

ew 

00 

000 

fck+OO 

000 

av 

00 

Oil 

bqu 

01 

Oil 

dk 

01 

000 

ex 

00 

000 

ga 

00 

000 

aw 

00 

000 

bck+OO 

01 1 

dl 

01 

000 

ey 

00 

01  1 

gb 

01 

000 

ax 

00 

000 

ca 

00 

01  1 

dm 

01 

000 

ez 

00 

01  1 

gc 

01 

000 

ay 

00 

Oil 

cb 

01 

01  1 

dn 

01 

000 

ech 

00 

Oil 

gd 

01 

000 

az 

00 

Oil 

cc 

01 

000 

do 

00 

01  1 

egh 

01 

000 

ge 

00 

000 

ach 

00 

Oil 

cd 

01 

01  1 

bp 

01 

000 

eph 

00 

01  1 

gf 

01 

000 

agh+OO 

000 

ce 

00 

100 

dr 

01 

000 

erh+OO 

010 

gg 

00 

000 

aph 

00 

01 1 

cf 

01 

010 

ds 

10 

000 

esh 

00 

Oil 

gh 

01 

000 

arh+OO 

Oil 

eg 

01 

Oil 

dt 

01 

000 

eth 

00 

000 

gi 

00 

000 

ash 

00 

01  1 

ch 

01 

000 

du 

00 

000 

ewh+OO 

Oil 

gj 

01 

000 

ath 

00 

000 

ci 

00 

Oil 

dv 

01 

001 

equ 

01 

000 

gk 

+00 

000 

awh+OO 

01  1 

cj 

01 

01  1 

dw 

01 

000 

eck 

00 

100 

gl 

-01 

001 

aqu 

01 

01  1 

ck 

01 

000 

dx  +00 

000 

fa 

00 

01  1 

gm 

01 

000 

ack 

00 

000 

cl  ■ 

-01 

000 

dy 

00 

Oil 

fb 

01 

01  1 

gn 

01 

000 

ba 

00 

01  1 

cm 

01 

01  1 

dz 

01 

01  1 

fc 

01 

000 

go 

00 

Oil 

bb 

01 

01  1 

cn 

01 

Oil 

dch 

01 

Oil 

fd 

01 

01  1 

gP 

01 

Oil 

be 

01 

000 

CO 

00 

01  1 

dgh 

01 

000 
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APPENDIX  II 


This  appendix 

RANDOM  WORD  ALGORITHM 

lists  the  algorithm  described  on  page  15.  Below  is 

a description  of  the  notation  and  the  variables  used  to  describe  a 


state . 

State  Variables 

Each  loop  through  the  algorithm  produces  new  values  of  several 
ctate  variables  and  possibly  adds  a unit  to  the  random  word  being 
formed.  The  variables  used  to  describe  the  state  are  defined  as  fol- 
lows. "Binary"  variables  may  have  the  value  "true"  or  "false";  "dec- 
imal" variables  have  a number  as  their  value. 


vowel_found 

Set  when  a vowel  is  found  in  a syllable  (binary). 

last_vowel_found 

Value  of  vowel_found  for  previous  unit  in  the  ran- 
dom word  (binary). 

syllable_length 

Number  of  units  in  syllable  so  far  (decimal,  ini- 
tially 1). 

index 

Number  of  units  in  word  so  far  (decimal,  initially 
1). 

cons_count 

Number  of  consecutive  consonants  (decimal) . 

nchars 

Number  of  letters  in  word  to  be  generated.  Ini- 
tially this  is  set  to  the  length  of  the  word  (in 
letters)  desired.  This  value  is  decremented  each 
time  a two-letter  unit  is  generated  so  that  the 
number  of  units  (index)  can  be  compared  to  nchars 
to  determine  if  the  end  of  the  word  has  been 
reached . 

unit( 1),  unit(2),  , 

, . . unit(index) 

Unit(i)  represents  the  i'th  unit  in  the  word. 
Unit(index)  is  the  current  unit. 

In  addition  to  the  state  variables,  two  variables  are  defined  for 
use  only  internal  to  the  algorithm.  They  are  used  to  simplify  the  no- 
tation. 
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v A binary  variable  which  is  set  when  the  unit  just  generated  is  a 
vowel  (or  an  alternate_vowel  to  be  treated  as  a vowel)* 

b A binary  variable  which  gets  set  when  a "break"  pair  (as  defined  on 
page  12)  is  encountered,  or  when  the  previous  pair  was  a "suffix" 
pair  and  the  current  unit  is  not  a vowel. 

Notation 

The  following  names  are  used  in  the  algorithm  for  the  eight  flags 
in  the  digram  table: 

begin, 
not_begin, 
end , 

not_end, 
break , 
prefix , 
suffix, 

and  illegal_pair . 

If  one  of  these  names  appears  with  a value  in  parentheses  immediately 
following  it,  as  "break(i)",  the  reference  is  to  the  "break"  flag  for 
the  pair  of  units  [unit(i-l),  unit(i)].  If  no  value  appears,  the  ref- 
erence is  to  the  pair  [unit(index-1 ) , unit(index)]  — that  is,  the 
reference  is  to  the  last  two  units. 

The  following  names  are  used  for  the  flags  in  the  unit  table: 

no_f inal_split , 
not_begin_syllable, 
vowel , 

alternate_vowel , 
and  double_letter . 

The  "double_letter"  flag  was  not  explicitly  mentioned  in  the  discus- 
sion earlier.  It  is  set  for  units  consisting  of  more  than  one  letter. 
A value  in  parentheses  following  the  name,  as  in  vowel(i),  refers  to 
the  vowel  flag  for  unit(i).  If  no  value  appears  the  reference  is  to 
the  flag  for  unit(index),  or  the  current  unit. 

Three  procedures  are  referred  to:  "random_unit"  is  a user-suppl- 

ied procedure  to  generate  a random  unit;  "random_vowel"  is  a user 
supplied  procedure  to  generate  a random  vowel  unit;  and  "done"  is  an 
internal  procedure  appearing  near  the  end  of  the  algorithm. 
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Algorithm 


The  algorithm  is  shown  in  the  following  pages.  The  text  of  the 
algorithm  is  essentially  the  same  as  the  main  body  of  the  random_word_ 
subroutine  appearing  in  Appendix  III.  The  algorithm  is  shown  here 
only  for  completeness  — it  can  stand  alone  and  does  not  depend  on  any 
support  subroutines.  Since  the  random_word_  subroutine  as  shown  in 
Appendix  III  is  well  documented  and  commented,  no  comments  are 
supplied  below.  A correspondence  between  this  algorithm  and  the 
random_word_  subroutine  can  easily  be  made. 

The  RANDOM_WORD  procedure  has  an  internal  procedure  DONE  listed 
near  the  end.  The  extents  of  the  if-then-else  clauses  are  indicated 
by  indentation. 


52 


RANDOM  WORD  ALGORITHM 


BEGIN  procedure  RANDOM_WORD 
retry: 

if  syllable_length  = 1 
then 

if  index  = nchars 
then  call  random_vowel 
else  call  random_unit 
if  (index  t 1 & illegal_pair ) 
then  go  to  retry 
syllable_length  = 2 
if  vowel  | alternate^ vowel 
then  cons_count  = 0 
else  cons_count  = 1 
last_vowel_found  = 0 
if  double_letter 
then 

if  index  = nchars  i (index  = nchars-1  & "vowel) 
then  go  to  retry 
else  nchars  = nchars  - 1 
else 

if  (syllable^ length  = 2 & "vowel_found  & index  = nchars) 
( "vowel__found  not_end( index- 1 ) ) & suffix(index-1 ) 
then  call  random_vowel 
else  call  random__unit 
if  illegal_pair  | 

(unit (index )=unit( index- 1 ) =unit (index-2)  & index>2) 
then  go  to  retry 

if  double_letter  & index  = nchars 

then  goto  retry 

else  nchars  = nchars  - 1 

if  vowel  ! (alternate_vowel  & "vowel(index-1 ) ) 
then  v = 1 
else  v = 0 

if  syllable_length  > 2 & suffix(index-1 ) & ~v 
then  b = 1 
else  b = break 

if  syllable^ length  = 2 & not_begin 
then  go  to  no  good 
if  vowel_found 
then 

if  cons_count  l 0 
then 

if  begin 
then 
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RANDOM  WORD  ALGORITHM 


(continued) 


if  syllable_length  i 3 & not_end(index-2) 
then 

if  not_end ( index- 1 ) 
then  go  to  no  good 
else  call  done(v,2) 
else  call  done(v,3) 
else 

if  not_begin 
then 
if  b 
then 

if  not_end ( index- 1 ) 
then  go  to  no  good 
else  call  done(v,2) 
else 
if  v 
then 

if  not_end ( index- 1 ) ! not_begin_syllable 
then  go  to  no  good 
else  call  done(1,2) 
else  call  done(~end,end) 

else 
if  v 
then 

if  not_end(index-2)  ! syllable_length  ~ = 3 
then 

if  not_end( index- 1 ) 
then 

if  cons_count  > 1 
then 

if  not_end( index-3)  I 
not_begin( index-1 ) 
then  go  to  no  good 
else  call  done(1,4) 
else  go  to  no  good 
else  call  done(1,2) 
else  call  done(1,3) 
else  call  done( 1,0) 

else 

if  v & vowel(index-2)  & index  > 2 
then  go  to  no  good 
else 
if  end 

then  call  done(0,1) 
else 
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RANDOM  WORD  ALGORITHM 


(continued ) 


if  begin 
then 

if  last_vowel_found 
then 
if  v 
then 

if  syllable_length  = 3 
then 

if  alternate_vowel(index-2) 
then  go  to  no  good 
else  call  done(1,3) 
else 

if  not_end(index-2) 
then  go  to  no  good 
else  call  done(1,3) 

else 

if  syllable_length  = 3 
then 

if  alternate_vowel(index-2) 
then  call  done( 1,3) 
else  go  to  no  Rood 
else 

if  not_end(index-2) 
then 

if  not_end(index-1 ) 
then  go  to  no  good 
else  call  done(0,2) 
else  call  done(1,3) 

else 

if  not_end(index-1 ) & syllable_length  > 2 
then  go  to  nogood 
else  call  done(v,2) 

else 
if  b 
then 

if  not_end(index-1 ) & syllable_length  > 2 
then  go  to  no  good 
else  call  done(v,2) 
else  call  done(1,0) 

else 
if  b 

then  go  to  no  good 
else 
if  end 
then 
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RANDOM  WORD  ALGORITHM 


(continued ) 


if  v 

then  call  done(0,l) 
else  go  to  no  good 
else 
if  v 
then 

if  begin  & syllable_length  > 2 
then  go  to  no  good 
else  call  done(1,0) 
else 

if  begin 
then 

if  syllable_length  > 2 
then  go  to  no  good 
else  call  done(0,3) 
else  call  done(0,0) 
go  to  retry 
no _jgood: 

if  double_letter  then  nchars  = nchars  + 1 
go  to  retry 


BEGIN  procedure  DONE: 

called  with  2 arguments:  call  done(vf,sl) 

if  si  i 2 & syllable_length  t 2 & prefix  & ~vowel(index-2) 
then 

if  vowel_found 
then 

if  not_end ( index- 1 ) 
then  go  to  no  good 
else 

call  done (0,2) 
return 

else  go  to  no  good 
else 

if  si  i 1 & index  = nchars  & (not_end  j vf  = 0) 
then  go  to  no  good 

if  index  = nchars  & no_final_split  & si  i 1 & “vowel ( index- 1 ) 
then 

if  “vowel_found  \ not_end( index- 1)  | syllable_length  < 3 

then  go  to  no  good 
else  si  = 0 
if  v ! si  = 1 
then  cons  count  = 0 
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RANDOM  WORD  ALGORITHM 


(concluded) 


else 

if  si  = 0 

then  cons_count  = cons_count  + 1 

else  cons_count  = min(syllable_length-1 , cons_count+1 ) 
if  si  = 0 

then  syllable_length  = syllable_length  + 1 

else  syllable_length  = si 

if  syllable_length  > 3 

then  last_vowel_found  = vowel_found 

else  last_vowel_found  = 0 

vowel_found  = vf 

return 

END  procedure  DONE 

END  procedure  RANDOM_WORD 
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APPENDIX  III 


SOURCE  CODE 


The  following  pages  contain  source  program  listings  of  every  com- 
mand and  subroutine  applicable  to  the  random  word  generator.  The 
listings  are  in  alphabetical  order  by  program  name.  Below  is  a list 
of  the  programs  with  a brief  description  of  the  function  of  each. 
Complete  documentation  of  the  usage  of  each  of  these  may  be  found  in 
Appendix  VI. 

In  Multics  there  is  a distinction  between  commands  and  subrou- 
tines. Commands  are  callable  by  the  user  from  his  terminal,  while 
subroutines  can  only  be  called  by  other  subroutines  or  commands.  The 
naming  convention  for  programs  specifies  that  subroutine  names  end 
with  a trailing  underscore,  while  command  names  should  not.  Generally 
the  commands  described  below  are  user  interfaces  to  specific  subrou- 
tines . 


columns  Prints  lists  of  random  words  in  columns. 

convert_word_  Subroutine  that  converts  an  array  of  unit  numbers 

to  characters.  Used  by  generate_word_. 

convert_word_char_  Subroutine  that  inserts  hyphens  into  a given  word 

and  formats  the  word  for  printing.  Used  by 
hyphen_test . 

digram_table_ compiler  Compiler  for  digram  table. 


generate_word. 


generate_words 
g e t_l  i n e_l  e n g t h 


hyphen_test 


Standard  interface  for  user-written  programs  to 
generate  a random  word.  Used  by  generate_words . 

Command  to  generate  a list  of  random  words. 

Command  and  subroutine  to  return  the  line  length 
of  the  output  medium  for  the  purposes  of  format- 
ting output.  Used  by  columns  and 
digram_table_compiler . Also  called 
get_line_length_. 

Command  to  hyphenate  a supplied  word  using  the 
rules  of  the  random  word  generator.  Also  used  to 
calculate  the  probability  of  a given  word. 
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hyphenate_ 

Subroutine  interface  to  perform  same  function  as 
hyphen_test.  Used  by  hyphen_test. 

random_unit_ 

Standard  subroutine  for  generating  a random  unit 
Referenced  by  generate_word_  and  hyphenate^  and 
called  by  random_word_. 

random_unit_stat_  Assembly  language  program  containing  definitions 

of  external  static  variables  used  by  random_unit 


random_word_ 

Subroutine  implementing  the  word  generator. 
Called  by  generate_word_  and  hyphenate^. 

read_table_ 

Subroutine  that  compiles  the  digram  table.  This 
is  an  internal  interface  only  called  by 
digram_table_compiler. 
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COMPILATION  LISTING  OF  SEGMENT  columns 

Compiled  by:  Experimental  PL/I  Compiler  of  Tuesday,  March  25,  1975  at  14:19 
Compiled  on:  04/01/75  1720.7  edt  Tue 

Options:  check  source 
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54  del  string  chart  132)  varying; 

55  del  real_length  fixed  bin; 
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if  substrClines,  nchars,  1)  “*  newline  /•  This  may  cause  problems  if  not  checked  •/ 
tben  call  ugly  (0,  "Last  line  in  segment  does  not  end  in  new  line  character"); 


we  expect  that  most  lines  won't  contain  tabs  in  them,  therefore 

the  code  below  avoids  calline  tab_expander  if  no  tabs  are  in  the  line. 
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/•  insert  formfeed  ■/ 
266  nn  = 

269  end; 

270  end; 
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320  ugly:  proc  (code,  message); 

321  del  code  fixed  bin ( 35 ) ; 

322  del  message  char(f); 

323  call  com_err_  (code,  "columns",  message); 

324  goto  terminate;  /*  nonlocal  goto  to  finish  up  */ 
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COMPILATION  LISTING  OF  SEGMENT  convert_word_char_ 

Compiled  by:  Experimental  PL/I  Compiler  of  Tuesday,  March  25,  1975  at  14:19 
Compiled  on:  04/01/75  1721.1  edt  Tue 

Options:  check  source 
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COMPILATION  LISTING  OF  SEGMENT  digram_table_compiler 

Compiled  by:  Experimental  PL/I  Compiler  of  Tuesday,  March  25,  1975  at  14:19 
Compiled  on:  04/01/75  1721.1  edt  Tue 

Options:  check  source 
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4 del  digrams$digrams  external; 

5 del  digraas$n_unlt3  fixed  bin  external; 

6 del  digrams$letters  external; 

7 del  digraaslrules  external; 
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121 

122  if  "compile  then  goto  dont_compile; 
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then  call  ioa_$nnl(copy(w“/"  , 66-first. 
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COMPILATION  LISTING  OF  SEGMENT  generate_word_ 

Compiled  by:  Experimental  PL/I  Compiler  of  Tuesday,  March  25,  1975  at  U:19 
Compiled  on:  0V01/75  1721.2  edt  Tue 

Options:  check  source 
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52  /*  This  entry  allows  the  user  to  set  the  seed.  If  the  seed  argument  is  zero,  we 

53  go  back  to  using  the  clock  value. 


56  del  teed  fixed  bin(35); 
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1 generate_worda:  gw:  procedure; 

2 del  cu  $arg  ptr  entry  (fixed, ptr, fixed, fixed  bin(35)); 

3 del  eu  $arg  ptr. rel  entry  (fixed  bin,  ptr,  fixed  bin,  fixed  bin(35)»  ptr); 

4 del  eu_$arg_list_ptr  entry  (ptr); 

5 del  argno  fixed; 
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52  line_length:  procedure  returns  {char(f)); 

53  del  ios_$ changemode  entry  (char(*)#  char{*)#  char(#)f  bit(72)  aligned); 
5*»  del  status  bit(72)  aligned; 

55  del  1 expanded_status  based  {addr(status) ) t 


56  2 bits  bit(36), 

57  2 status_code  fixed  bin(35); 

58  del  numbers_index  fixed  bln; 

59  del  ll_index  fixed  bin; 
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COMPILATION  LISTING  OF  SEGMENT  hyphen_test 

Compiled  by:  Experimental  PL/I  Compiler  of  Tuesday,  March  25,  1975  at  14:19 
Compiled  on:  04/01/75  1721.5  edt  Tue 

Options:  check  source 
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COMPILATION  LISTING  OF  SEGMENT  hyphenate_ 

Compiled  by:  Experimental  PL/I  Compiler  of  Tuesday,  March  25,  1975  at  14:19 
Compiled  on:  04/01/75  1721.5  edt  Tue 

Options:  check  source 
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60  del  (unitjprobabili ties  based  (u_p_ptr),  vowe Improbabilities  based  (v_p_ptr))  dim  (n_unlts)  float  bin; 
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64  spllt_point  s 0; 

65  calculate  = "O^b;  /*  we  aren't  calculating  probabilities,  Just  hyphenating  */ 

66  goto  continue; 
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/*  These  probabilities  are  calculated  merely  by  adding  up  the  number  of 
occurances  of  each  of  the  unit  indexes  in  the  numbers  array  and  the 
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35  del  1 rules(n_unita)  aligned  baaed  (addr( digramajrulea) ) , 
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18  /•  PROCEDURE  GENERATEJJNIT  V 
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call  ioa_$nnl("100  tries  failed  to  generate  unit.*/  password  so  far  is: 
do  i = 1 to  index; 

call  ioa_$nnl("*a" , letters(pa3sword(i) ) ) ; 


«“»  c xz 


H £ 


•-  -i  L -a  c 
•a  -i  — c t, 

C It  T5  H O 3 

4)  CJ  C «-•  O *->  1 


O C •»  C 
>,  3 
I -> 

E E -I  C 

-t  o o c *> 

d 3 tj  3 r 

S C C w 

o m nj  n 

> C C X>  O 


| L C V O 

a 4)  n o 

o>  xz  < — < 4>  4 

4)  4-1  4)  ••?}  , 


2 .. 
n 4) 
• L 

« 


« 4->  T3 
-H  *-> 

0)  4J 

n «~i  n 


4J  K K • 

| — I -t  * 

*>  -X  4-.  4_  T>  nJ 

O flJ  4)  4-  C 4> 


2^ 


i 5 x I 

x now 

> o n V L 

) T3  «1  4J 

C & C *3 
i ^ C 4>  *j 

. C 4-1  XZ  O 

4)  4-  £ -(  .J  ^ 

* XZ  4i  4-i  £ 

I *J  4-  XZ 


X)  fl)  > 
ICC. 
o c 

n o 4J  *j 

• c 4)  +j  <n 

a l 

x:  « -h 

U II  .4-! 

I —'■on 

. II  X C 4J 

4)  O 4 . 

n -o  o a 

C C 4)  C 


■o  ••  • 

c -a  -> 

o c -a 

o 4;  c 

4;  - o 

n — ' o 

► TJ  4J 

4)  o *j  c n 

5 4)  n O - 

o w *“  4)  n 

• *j  4-  n c 

— * n w - —i 


-Sg 


n m<m  io 

S t 

- ra  TJ  « DC 

c e «h 
u 4 id  -a 
— t c 
■o  C M II 

II  5 -O 

4)  C 

5 “°|  " *| 

m *j  -a 

4)  O C O 


C 4J  X>  C 


3 0 

x a.  n o x> 

n • • 4> 

. -v  n a> 

— ' tj  -a  - a 

■OCC4JS- 

c o o n o 

O O O t-  4-. 

y 4)  ii  -h 

4>  n n 4-.  » 

n - -w  v. 

- *J  •*->  CO 

*->  n n e 

n c c re 

4 -I  *H  L f\l 

-H  4_  4_  tX 

4-.  w v-.  -I  /\ 

w n n -a 

B rt  § ii  5 

Sec  tc 

t U L C 

bC  -H  — H —“I  i) 

— i -o  -a  n — i 

■o  a 

II  II  | 4) 
II  r-4  H 

X X fl]  XI 

J<  -I  -I  M 10 

(0  4.  e 4)  H 

4)  4)  4-.  —I  —I 

e e o — * >. 

xi  a n -i  n 


4->  4)  «4  £ H 


tox>h-oocr>o  «-<M  oo  3 ixivXJt^ooONO  — (M  oo  3 ir\'£>t'-oo  O'O  — f\i  rn  =r  lo  x y-  oo  <x>  O — <M  ipo  r--  oo  o\  o «-  fo  uo  xi 

E — E — E — t — t — OOCOCOCOCOCDCOOOCOOO(T'(7'(7'(7'l3><T><J'a'a'(7'0000000000^  — — — *—  — — — r~  r~  <\J  <M  (\l  <\i  <N  <\l  <M 


97 


227  then  /•  vowel  and  this  unit  is  not  a vowel  ■/ 

228  if  digrans( password( index-2) , first) .suffix 

229  then 

230  if  *v  then  break  = "1"b;  /*(if  last  pair  was  not_end,  new_unit  gave  us  a vowel)*/ 


for  the  last  n units  in  this  syllable,  to  be  interpreted  as  follows: 
v stands  for  a vowel  (including  alternate_vowel) 
c stands  for  a consonant 
x stands  for  any  unit 
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random_word_. list  0U/01/75  1729-^  edt  Tue  pag* 
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448  else 

449  if  unit  < 0 

450  then  goto  accept ed_but_keep_trying; 

451  else  si  = 0; 
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s s 
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iled  by:  Experimental  PL/I  Compiler  of  Tuesday, 
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10  del  (i,  j,  k,  1)  fixed  bin; 

11  del  errf lag  bit(1)  init("0"b); 

12  del  fata  l^f lag  bi t(  1 ) init<"0»b); 
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read_table_. list  04/01/75  1729.5  edt  Tue  page 
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31*  /•  check  character  following  for  comma,  new_line,  or  semicolon  */ 
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91  call  addline  ("equ  n,"  •!  substr(character(i-l ) , verify(chiracter (i-1 ) , " ")));  /•  first  line  of  ALH  program  */ 
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char  s substr( source_table, loc, 1 ) ; 


read_table__. list  04/01/75  1729.5  edt  Tue  page 
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call  ioa_$nnl( "alpha  character  expected") 
goto  err; 


.table_.  list  04/01/75  1729.5  edt  Tue  page 
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call  tera  >3Cg  ptr  (30urce_table_ptr,  code); 
return(errflag) ; 
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call  ioa_$nnl(" mmm  IS  letters_split(i) .second  II  """  expected1 
goto  err; 


419  /•  Add  a line  to  ALM  program  •/ 

420 

421  addline:  proc  (string); 

422  del  string  char (30)  varying; 
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APPENDIX  IV 


2000  RANDOM  WORDS 


The  2000  random  words  listed  on  the  following  pages  were  genera 
ted  in  one  particular  sample  run  using  the  tables  described  in  Appen 
dix  I.  See  page  18  for  a description  of  this  listing. 
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2000  RANDOM  WORDS 


acbra 

ac-bra 

anpedavi 

an-pe-da-vi 

baiddbyt 

baidd-byt 

accarsju 

ac-cars-ju 

anviv 

an-viv 

bajoo 

ba-joo 

acmico 

ac-mi-co 

anwobaj 

an-wo-baj 

balhayo 

bal -ha-yo 

acnaw 

ac-naw 

apcloy 

ap-cloy 

baliom 

ba-li-om 

adakgem 

a-dak-gem 

apdrase 

ap-drase 

baquon 

ba-quon 

addazov 

ad-da-zov 

apkudaci 

ap-ku-da-ci 

basclwa 

bas-ci-wa 

addwus 

add-wus 

apnopku 

ap-nop-ku 

b as fag 

bas-fag 

adeocro 

a-de-oc-ro 

apwry 

ap-wry 

becamnob 

be-cam-nob 

adfarvra 

ad-farv-ra 

araco 

a-ra-co 

becgroha 

bec-gro-ha 

adfrobga 

ad-frob-ga 

arcmawmo 

arc -maw -mo 

beckreo 

bec-kre-o 

adicoc 

a-di-coc 

arego 

a-re-go 

becwyd 

bec-wyd 

adkrev 

ad-krev 

arjhoi 

ar j-hoi 

bedfleey 

bed-fleey 

adlisa 

ad-li-sa 

armvodru 

arm-vo-dru 

bedibhi 

be-dib-hi 

adoif 

a-doif 

arobli 

a-ro-bli 

bejeg 

be-jeg 

adtemruf 

ad-tem-ruf 

arsshu 

ars-shu 

benchdyn 

bench-dyn 

advuj 

ad-vuj 

aseld 

a-seld 

beofy 

be-of-y 

afttwir 

aft-twir 

aseyjaha 

a-sey-ja-ha 

beokyo 

be-ok-yo 

agniji 

ag-ni-ji 

ashfa 

ash-fa 

berho 

ber-ho 

agromjax 

a-grom-jax 

ashuwirp 

as-hu-wirp 

berhyveu 

ber-hy-veu 

agrovca 

a-grov-ca 

asuwoj 

a-su-woj 

betavi 

be-ta-vi 

aiboc 

ai-boc 

ateunnga 

a-teunn-ga 

betwey 

be-twey 

aicboaj 

aic-boaj 

athoigna 

a-thoig-na 

bevsnudd 

bev-snudd 

aijsav 

aij-sav 

atkopyej 

at-ko-pyej 

bevtu 

bev-tu 

aint jee 

aint-jee 

atojshyn 

a-toj-shyn 

bladbeng 

bi-ad-beng 

ajdama 

aj-da-ma 

atshub 

atsh-ub 

bibjav 

bib-jav 

ajhery 

aj -he-ry 

atwej 

a-twej 

binchrod 

binch-rod 

ajkealv 

aj-kealv 

auhuva 

au-hu-va 

biphs 

biphs 

aj lytwa 

aj-ly-twa 

aupkahy 

aup-ka-hy 

bipku 

bip-ku 

ajnek 

aj-nek 

auptcu 

aupt-cu 

bippu 

bip-pu 

akdil 

ak-dil 

avavy 

a-vav-y 

bivics 

bi-vics 

akhec 

ak-hec 

avcarmev 

av-car-mev 

biyebryg 

bi-ye-bryg 

akhlbwos 

ak-hib-wos 

avriss 

av-riss 

biyus 

bi-yus 

akjaruj 

ak-ja-ruj 

avthwy 

av-thwy 

blaidcej 

blaid-cej 

akklokto 

ak-kl ok-to 

avthyve 

av-thyve 

bleahiya 

blea-hi-ya 

akprujo 

ak-pru-jo 

avutman 

a-vut-man 

blipjove 

blip-jove 

aktadssu 

ak-tads-su 

awwecba 

aw-wec-ba 

blitfefe 

blit-fefe 

alcho 

al-cho 

aycleti 

ay-cle-ti 

blofe 

blofe 

algofwee 

al-gof-wee 

ayjedsi 

ay-jeds-i 

blyi jnee 

bly-ij-nee 

alltomp 

all-tomp 

ay  low 

ay -low 

blyvabs 

bly-vabs 

altkeye 

alt-keye 

aymsfop 

ayms-fop 

bocks 

bocks 

alwa  f i 

al-wa-f i 

ayootta 

a-yoot-ta 

bocwa 

boc-wa 

ambrigno 

am-brig-no 

aypvihy 

ayp-vi-hy 

bodjobli 

bod-jo-bli 

amshy 

am-shy 

aysig 

ay-sig 

bogcet 

bog-cet 

amvacs 

am-vacs 

babfelby 

bab-f el-by 

bogvuswy 

bog-vu-swy 

amvuti 

am-vu-ti 

bacgebvo 

bac-geb-vo 

bojiri 

bo-ji-ri 

anafniv 

a-naf-niv 

bafdacga 

baf-dac-ga 

booval 

boo-val 
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(continued) 


boshtpel 

bosht-pel 

ceas jota 

ceas-jo-ta 

ciscreny 

cis-cren-y 

boudcof 

boud-cof 

ceays 

ceays 

civbybab 

civ-by-bab 

bowyt 

bo-wyt 

ceehat 

cee-hat 

civ jece 

civ-jece 

brad cur 

brad-cur 

ceekasom 

cee-ka-sora 

cixdo 

cix-do 

brefep 

bre-fep 

ceemm 

ceemm 

clefno 

clef-no 

brerthy 

brer-thy 

ceethpun 

ceeth-pun 

clehy 

cle-hy 

brighe 

brighe 

ceetvif 

ceet-vif 

clislo 

cli-slo 

brikaw 

bri-kaw 

cegmowec 

ceg-mo-wec 

cliwa 

cli-wa 

broctbar 

broct-bar 

cegpu 

ceg-pu 

clorgcy 

clorg-cy 

brolwoi 

brol-woi 

cegvu 

ceg-vu 

coasbebi 

coas-be-bi 

bronjept 

bron-jept 

ceigi 

cei-gi 

cocuevo 

co-cue-vo 

bruosta 

bru-os-ta 

ceindhy 

ceind-hy 

codfri 

cod-fri 

bruvlufe 

bruv-lufe 

ce jiogmo 

ce-ji-og-mo 

coftti 

cof t-ti 

bryarne 

bryarne 

cem jatbu 

cem-jat-bu 

comtdoa 

comt-doa 

bryffoj 

bryf-fo j 

cenjo 

cen-jo 

conclay 

con-clay 

bryse 

bryse 

cenved 

cen-ved 

conmeco 

con-me-co 

bucushu 

bu-cu-shu 

ceoliho 

ce-o-li-ho 

copsgha 

cops-gha 

buebwu 

bueb-wu 

ceowf 

ce-owf 

copuac 

co-pu-ac 

bussdene 

buss-dene 

cerba 

cer-ba 

corlibbu 

cor-lib-bu 

butoc 

bu-toc 

cestha 

ces-tha 

coshryg 

co-shryg 

buyen 

bu-yen 

ceugcho 

ceug-cho 

cotkni 

cot-kni 

buyovna 

bu-yov-na 

ceuhau 

ceu-hau 

cottrufa 

cott-ru-fa 

bybguf 

byb-guf 

cewf  u 

cew-fu 

couhile 

cou-hile 

bybhihyp 

byb-hi-hyp 

cewphjo 

cewph-jo 

couwukfu 

cou-wuk-fu 

bycij 

by-cij 

cezcy 

cez-cy 

coysfo 

coys-fo 

byhojoc 

by-ho-joc 

chaf ja 

chaf-ja 

cozrer 

coz-rer 

byipio 

by-i-pi-o 

chaheog 

cha-he-og 

crafnec 

craf-nec 

by joarsy 

by-joar-sy 

chatmum 

chat-mum 

craktamu 

crak-ta-mu 

bykmeol 

byk-me-ol 

chetki 

chet-ki 

crasco 

cras-co 

bylij 

by-lij 

chewchut 

chew-chut 

cravpo 

crav-po 

bynri 

byn-ri 

chimwed 

chim-wed 

ere jorry 

cre-jor-ry 

bypvee 

byp-vee 

chishrai 

chi-shrai 

crelerhi 

cre-ler-hi 

byruc 

by-ruc 

chivi 

chi-vi 

crer ju 

crer-ju 

bysstoct 

bys-stoct 

choasdy 

choas-dy 

cribmac 

crib-mac 

bytatha 

by-ta-tha 

chrubf u 

chrub-fu 

crova 

cro-va 

cacky 

cack-y 

chuspryn 

chusp-ryn 

cru japki 

cru-jap-ki 

caged 

ca-ged 

chysnior 

chys-ni-or 

crybtoi 

cryb-toi 

caibo 

cai-bo 

ciaki 

ci-a-ki 

cryce 

cryce 

cajthett 

caj-thett 

cibbihi 

cib-bi-hi 

cryjawl 

cry-jawl 

calkny 

cal-kny 

cickka 

cick-ka 

cubyuct 

cu-byuct 

carurla 

ca-rur-la 

cienbafi 

cien-ba-fi 

cucksja 

cucks-ja 

catseco 

cats-e-co 

cifdabgi 

cif-dab-gi 

cucsmu 

cucs-mu 

cauge wd 

cau-gewd 

cigzawm 

cig-zawm 

cuitreb 

cu-it-reb 

cawete 

ca-wete 

cijpuyon 

cij-pu-yon 

cu jmemy 

cuj-me-my 

cawlgu 

cawl-gu 

cilch 

cilch 

cuoywri 

cu-oy-wri 

ceakhen 

ceak-hen 

cingloo 

cin-gloo 

cupco 

cup-co 
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curirai 

cu-ri-rai 

decreab 

dec-reab 

dodnirwa 

dod-nir-wa 

cuwaso 

cu-wa-so 

dedycea 

de-dy-cea 

doglin 

do-glin 

cuyisguc 

cu-yis-guc 

deerb 

deerb 

dogsjugs 

dogs-jugs 

cuyovvif 

cu-yov-vif 

defusk 

de-fusk 

dokabe 

do-kabe 

cyayo 

cya-yo 

deipie 

dei-pie 

dokmo 

dok-mo 

eye e eke o 

cy-ceck-e-o 

dejev 

de-jev 

domoo 

do-moo 

cyciacki 

cy-ci-ac-ki 

delro 

del-ro 

donond 

do-nond 

cyeemto 

cyeem-to 

depcans 

dep-cans 

doslyip 

do-sly-ip 

cyi tma 

cy-it-ma 

derjogfo 

der-jog-fo 

dotlu 

dot-lu 

cykku 

cyk-ku 

desgaku 

des-ga-ku 

dotog 

do-tog 

cylow 

cy-low 

deshleon 

de-shle-on 

doukdywo 

douk-dy-wo 

cylydga 

cy-lyd-ga 

detha 

de-tha 

dowigoco 

do -wi -go -co 

cymri 

cym-ri 

dethmewg 

deth-mewg 

dowvays 

dow-vays 

cymshday 

cymsh-day 

detmek 

det-mek 

doynn 

doynn 

cyngai 

cyn-gai 

detsbo 

dets-bo 

draja 

dra-ja 

cyoath 

cyoath 

detwynd 

de-twynd 

dralop 

dra-lop 

cyocyaf 

cyo-cyaf 

dexba 

dex-ba 

dreje 

dreje 

cyppio 

cyp-pi-o 

dicte 

dicte 

dremra 

drem-ra 

cysofbi 

cy-sof-bi 

diddy 

did-dy 

dreoms 

dre-oms 

cytgipe 

cyt-gipe 

dieje 

dieje 

dresyji 

dre-sy-ji 

cyvue j 

cy-vue j 

diethi 

die-thi 

drietoi 

drie-toi 

cywrof t 

cy-wroft 

dieyedi 

die-ye-di 

drite 

drite 

dabowt 

da-bowt 

diffos 

dif-fos 

drobveje 

drob-veje 

dacoafa 

da-coa-fa 

digba 

dig-ba 

droeja 

droc-ja 

dacoryun 

da-co-ryun 

digisubs 

di-gi-subs 

drodvove 

drod-vove 

da  era 

dac-ra 

digwy 

dig-wy 

droiro 

droi-ro 

dadpawa 

dad-pa-wa 

dihixag 

di-hix-ag 

droswen 

dro-swen 

dafekoab 

da-fe-koab 

dihyswif 

di-hy-swif 

druaney 

dru-a-ney 

dagby 

dag-by 

di jdaisk 

dij-daisk 

druche 

druche 

daheeno 

da-hee-no 

dijeyli 

di-jey-li 

drykucco 

dry-kuc-co 

daiyino 

dai-yi-no 

dijhejpo 

di j-hej-po 

dryoporc 

dryo-porc 

dakfi 

dak-fi 

dikruind 

dik-ru-ind 

duetjiso 

duet-ji-so 

darad 

da-rad 

dilgwi 

dilg-wi 

duibda 

du-ib-da 

darrgage 

darr-gage 

dindna 

dind-na 

duneoy 

du-ne-oy 

dashon 

da-shon 

dirssa 

dirs-sa 

dunzeshi 

dun-ze-shi 

da  shy 

da-shy 

dithwuic 

dith-wu-ic 

durrdo 

durr-do 

da syce 

da-syce 

ditvayps 

dit-vayps 

dyackdy 

dyack-dy 

dasypfea 

da-syp-fea 

diufo 

di-u-fo 

dydfro 

dyd-fro 

datlyka 

dat-ly-ka 

diumi 

di-u-mi 

dy  few 

dy-few 

datrouwa 

dat-rou-wa 

diutcic 

di-ut-cic 

dyfow 

dy-fow 

davpy 

dav-py 

divka 

div-ka 

dyklys 

dyk-lys 

dawdi 

dawd-i 

diwroxa 

di-wrox-a 

dykso 

dyks-o 

dawjokhu 

daw-jok-hu 

diyondy 

di-yon-dy 

dynuci 

dy-nu-ci 

dayru 

day-ru 

doacrie 

doac-rie 

dyteo 

dyt-co 

deas fis 

deas-fis 

dobheby 

dob-he-by 

dytsner 

dyts-ner 

debkav 

deb-kav 

docta 

doc-ta 

dywra 

dy-wra 
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dywud 

dy-wud 

efwupdan 

ef-wup-dan 

ethkef f 

eth-kef f 

eabcet 

eab-cet 

egalo 

e-ga-lo 

ethukkli 

e-thuk-kli 

eagodco 

ea-god-co 

egcachy 

eg-ca-ehy 

etnoween 

et-no-ween 

ea  jco 

ea j-co 

egidgema 

e-gid-ge-ma 

etosi 

e-to-si 

ealgado 

eal-ga-do 

egoute 

e-goute 

etsva 

ets-va 

eapbetga 

eap-bet-ga 

egumsh 

e-gurash 

eubgons 

eub-gons 

eatvafea 

eat-va-fea 

egyigtuf 

eg-yig-tuf 

euckus 

euck-us 

eaybfoa 

eayb-foa 

eibeu 

ei-beu 

eudapgol 

eu-dap-gol 

ebeogi 

e-be-o-gi 

eikbra 

eik-bra 

euf  zo 

euf-zo 

ebjiab 

eb-ji-ab 

eippu 

eip-pu 

eurcea 

eur-cea 

eblis 

e-blis 

ejcodfej 

ej-cod-fej 

evauya 

e-vau-ya 

ebnat 

eb-nat 

ejeowg 

e-je-owg 

evbue 

ev-bue 

ebveo 

eb-ve-o 

ejkehib 

e j-ke-hib 

evcliho 

ev-cli-ho 

eceink 

e-ceink 

e jtiarr 

ej-ti-arr 

evcrof 

ev-crof 

ecjasvu 

ec-jas-vu 

ekcee 

ek-cee 

eviradelm 

e-vim-delm 

ecka jhyn 

eck-aj-hyn 

ekdedva 

ek-ded-va 

evluhek 

ev-lu-hek 

ecmitt 

ec-mi tt 

ekdeu 

ek-deu 

evrantan 

ev-ran-tan 

ecnajo 

ec-na-jo 

ekeroa 

e-ke-roa 

evsarro 

ev-sar-ro 

ecouvda 

e-couv-da 

ekjapriu 

ek-jap-ri-u 

evurf  swa 

e-vurf-swa 

ecsfipe 

ecs-f ipe 

elcyvo 

el-cy-vo 

evuvi 

e-vu-vi 

ecuna 

e-cu-na 

elipiecy 

e-li-pie-oy 

evvof 

ev-vof 

ecvansh 

ec-vansh 

elo jchod 

e-loj-chod 

ewbknepu 

ewb-kne-pu 

ecywa 

e-cy-wa 

elolveag 

e-lol-veag 

ewdkaph 

ewd-kaph 

edcou j 

ed-couj 

emdrepe 

em-drepe 

ewecy 

e-we-cy 

eddyhi 

ed-dy-hi 

eratid 

era-tid 

ewisanyu 

e-wi-san-yu 

edfevu 

ed-f  e-vu 

encypi 

en-cy-pi 

ewreckab 

e-wreck-ab 

edgoj 

ed-goj 

enddy 

end-dy 

ewskaye 

ews-kaye 

edhucaw 

ed-hu-caw 

enkpalt 

enk-palt 

eybemi 

ey-be-mi 

edmuir 

ed-mu-ir 

envo  j 

en-vo j 

eycust 

ey-cust 

edonoi 

e-do-noi 

enyew 

en-yew 

eykeosfu 

ey-ke-os-f u 

edweep 

ed-weep 

eojmyg 

e-oj-myg 

eysba 

eys-ba 

eedeky 

ee-dek-y 

eonco 

e-on-co 

eytwi 

ey-twi 

eedneka 

eed-ne-ka 

eop jow 

e-op-jow 

ezkowhu 

ez-kow-hu 

eedpo 

eed-po 

epmiga 

ep-mi-ga 

facjacjo 

fac-jac-jo 

eehow 

ee-how 

epodmi 

e-pod-rai 

fafyevby 

faf-yev-by 

eejor 

ee-jor 

epoldto 

e-pold-to 

f ahawd 

fa-hawd 

eekvusu 

eek-vu-su 

epphew 

ep-phew 

fahemfai 

fa-hera-fai 

eemju 

eera-ju 

erfki 

erf-ki 

faifraeef 

faif-raeef 

eemyscle 

ee-myscle 

ersunaj 

er-su-naj 

f akpluer 

fak-pluer 

eengo 

een-go 

ertaho 

er-ta-ho 

falka 

fal-ka 

eepous 

ee-pous 

eruri 

e-ru-ri 

f alryds 

fal-ryds 

eeswygs 

ee-swygs 

eryesko 

e-ryes-ko 

f ariribs 

fa-ri-ribs 

eetrazeb 

eet-ra-zeb 

eshlazy 

e-shla-zy 

farul 

fa-rul 

efonry 

e-fon-ry 

eshro 

e-shro 

fathba 

fath-ba 

efrud 

e-frud 

esroyeba 

es-ro-ye-ba 

f awn eg 

f aw-neg 

eftha 

eft-ha 

essocove 

es-so-cove 

fecdruba 

fec-dru-ba 
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fedtyo 

fed-tyo 

fralav 

fra-lav 

ge okays 

ge-o-kays 

feent 

feent 

frepond 

fre-pond 

gesfi 

ges-fi 

fehukemo 

fe-hu-ke-mo 

frertci 

frert-ci 

getito 

ge-ti-to 

feike 

feike 

frerwru 

frer-wru 

geunejka 

geu-nej-ka 

fejwi 

fej-wi 

freyt 

f reyt 

gev jarsk 

gev-jarsk 

fe  jwo 

fej-wo 

fridetyp 

fri-de-typ 

ghassy 

ghas-sy 

fekblyba 

fek-bly-ba 

frivik 

fri-vik 

ghawndik 

ghawn-dik 

femliurp 

fem-li-urp 

froadwix 

froad-wix 

ghebgeb 

gheb-geb 

fenbawjo 

fen-baw-jo 

froki  jo 

fro-ki-jo 

ghecmarn 

ghec-marn 

ferhyts 

fer-hyts 

fromi 

fro-mi 

gheebo 

ghee-bo 

feroda 

fe-ro-da 

fryayst 

fryayst 

ghefrap 

ghe-frap 

feushno 

feu-shno 

fryood 

fryood 

ghelim 

ghe-lim 

fewjoli 

few-jo-li 

frypli 

fryp-li 

ghibgeks 

ghib-geks 

feyel 

fe-yel 

f uand 

fu-and 

ghinwa 

ghin-wa 

feyso 

fey-so 

fucfamme 

fuc-famme 

ghits 

ghits 

ficfu 

fic-f u 

fuchroaw 

fuch-roaw 

ghormfu 

ghorm-fu 

fierb ju 

fierb-ju 

fuehuega 

fue-hue-ga 

giadya 

gi-a-dya 

fiewods 

fie-wods 

fugheyn 

f ug-heyn 

giewob 

gie-wob 

filhi 

fil-hi 

fugjijo 

fug-ji-jo 

gigblody 

gig-blo-dy 

filhi 

fil-hi 

fuhuol 

fu-hu-ol 

gilfyuev 

gilf-yuev 

filiwuth 

fi-li-wuth 

fuifew 

fu-i-f  ew 

gimlak 

gim-lak 

fisabthu 

fi-sab-thu 

fu jdeyvi 

f u j-dey-vi 

ginzu 

gin-zu 

fisdu 

fis-du 

funnga 

funn-ga 

gipdacna 

gip-dac-na 

fitbat 

fit-bat 

futho 

fu-tho 

gipquoi 

gip-quoi 

fithvajo 

fith-va-jo 

fuyet 

fu-yet 

gipromso 

gip-rom-so 

fitwif 

fi-twif 

gadma jbu 

gad-raaj-bu 

girtwu 

gir-twu 

fiufrase 

fi-u-frase 

gaibci 

gaib-ci 

giwarnn 

gi-warnn 

fleac 

fleac 

gairgy 

gairg-y 

glatt 

glatt 

flecky 

fleck-y 

gaisuvi 

gai-su-vi 

glecjibs 

glee-jibs 

fleudfu 

fleud-fu 

galdno 

gald-no 

gliafuj 

gli-a-fuj 

flywond 

fly-wond 

ganrym 

gan-rym 

glicy 

gli-cy 

foDdry 

fob-dry 

garive 

ga-rive 

glodim 

glo-dim 

fociogu 

fo-ci-o-gu 

gatidfo 

ga-tid-fo 

glojpyt 

gloj-pyt 

focvuso 

foc-vu-so 

gavfa 

gav-fa 

glyceilm 

gly-ceilm 

fof idry 

fo-fi-dry 

gavneg 

gav-neg 

glymkon 

glym-kon 

fofyind 

fof-yind 

geajdu 

geaj-du 

goatgope 

goat-gope 

fogfli 

fog-fli 

gecpoby 

gec-po-by 

gobpo 

gob-po 

foinlir 

foin-lir 

gedad 

ge-dad 

gocmi 

goc-mi 

foivni 

foiv-ni 

geddni 

gedd-ni 

gofya 

gof-ya 

fokcrif 

fok-crif 

gefocfef 

ge-foc-fef 

gogruoco 

go-gru-o-co 

fokwuv 

fok-wuv 

gefpup 

gef-pup 

goiduco 

goi-du-co 

fonwuwri 

fon-wu-wri 

geicnand 

geic-nand 

goiwop 

goi-wop 

fooyd 

fooyd 

gelikac 

ge-li-kae 

gojosody 

go-jo-so-dy 

fotha 

fot-ha 

geljelo 

gel-je-lo 

gomwu 

gom-wu 

fraca 

fra-ca 

gerada 

gem-da 

gonpacyu 

gon-pa-cyu 

fraja 

fra-ja 

genaho 

ge-na-ho 

goohungy 

goo-hung-y 

118 


(continued) 


gosha 

gos-ha 

hocfoyd 

hoc-foyd 

idikicu 

i-di-ki-cu 

gosieg 

go-sieg 

hocky 

hock-y 

idmybe 

id-mybe 

goyelts 

go-yelts 

hodvoi 

hod-voi 

idsidro 

ids-i-dro 

goyndnia 

goynd-ni-a 

hofcroy 

hof-croy 

idsod 

ids-od 

grabiwar 

gra-bi-war 

hogsh 

hogsh 

idtryted 

id-try-ted 

grafsgi 

grafs-gi 

hoheckni 

ho-heck-ni 

ifaihu 

i-fai-hu 

greccla 

grec-cla 

hoisu 

hoi-su 

ifanciry 

i-f  an-ci-ry 

grinfict 

grin-f ict 

hokdu 

hok-du 

ifcasihi 

if -ca-si-hi 

grodore 

gro-dore 

homeb 

ho-meb 

ifietkla 

i-fiet-kla 

grogeddo 

gro-ged-do 

hooll 

hooll 

ifretwye 

i-fre-twye 

grui je 

gru-i je 

hophli 

ho-phli 

ifrie 

i-frie 

grumura 

gru-mum 

hordeett 

hor-deett 

if t tan 

if t-tan 

grykpho 

gryk-pho 

howusilu 

ho-wu-si-lu 

ifwri 

if-wri 

grypvair 

gryp-vair 

hoycaisy 

hoy-cai-sy 

igekni 

i-ge-kni 

gulty 

gul-ty 

hucte 

hucte 

ig  jit 

ig-jit 

gurzijcy 

gur-zij-cy 

hupbiv 

hup-biv 

igmur 

ig-mur 

guskni 

gus-kni 

hycalo 

hy-ca-lo 

igviva 

ig-vi-va 

hadgha 

had-gha 

hygept 

by-gept 

igwaur 

ig-waur 

hadghoce 

had-ghoce 

hyjane 

hy-jane 

igyeithe 

ig-yeithe 

hafbaj 

haf-baj 

hyktovo 

hyk-to-vo 

i jnaldo 

i j-nal-do 

hajcy 

haj-cy 

hyteuka 

hy-teu-ka 

ijnarca 

i j-nar-ca 

hapdiff 

hap-di f f 

hyunasyg 

hyu-na-syg 

i jobi 

i-jo-bi 

hapfowdo 

hap-fowd-o 

hyvathad 

hy-vat-had 

i jvop 

ij-vop 

harisu 

ha-ri-su 

hyvock 

hy-vock 

ikibsa 

i-kib-sa 

haths 

haths 

ibdogin 

ib-do-gin 

iligmu 

i-lig-mu 

hat jec 

hat-jec 

ibiwo 

i-bi-wo 

ilrybna 

il-ryb-na 

hatse 

hats-e 

iblees 

i-blees 

ilt  jo 

ilt- jo 

heetad 

hee-tad 

ibolel 

i-bo-lel 

ilvnev 

ilv-nev 

heeycri 

heey-cri 

ibyat 

i-byat 

impfiuna 

imp-fi-u-na 

heilgheg 

heil-gheg 

ibyeliwi 

i-bye-li-wi 

inagwo 

i-nag-wo 

hemcying 

hem-cy-ing 

icbetcuk 

ic-bet-cuk 

incileo 

in-ci-le-o 

herewru 

he-re-wru 

icbryso 

ic-bry-so 

inkaif f 

in-kaif f 

hexwu 

hex-wu 

icde jcu 

ic-de j-cu 

inotho 

i-no-tho 

hicbesiv 

hic-be-siv 

icdesi 

ic-de-si 

inovinbo 

i-no-vin-bo 

hidf apo 

hid-fa-po 

ichdu 

ich-du 

inpak 

in-pak 

hieth 

hieth 

ichoad 

i-choad 

inthshor 

inth-shor 

hifdy 

hif-dy 

ichwrorl 

ich-wrorl 

iof iow 

i-o-fi-ow 

hi jnan 

hi j-nan 

iclelyd 

ic-le-lyd 

iof  ja 

i-of-ja 

hi jraja 

hi j-ra-ja 

icpeydrau 

ic-peyd-mu 

ipacnor 

i-pac-nor 

hirgu 

hir-gu 

icque j 

ic-que j 

ipcro 

ip-cro 

hirquav 

hir-quav 

icryp 

ic-ryp 

ipgha 

ip-gha 

hisibyru 

hi-si-by-ru 

icsdafo 

ics-da-fo 

ipgior 

ip-gi-or 

hispko 

hisp-ko 

icsroba 

ics-ro-ba 

iphgokvu 

iph-gok-vu 

hiuby 

hi-u-by 

icweacks 

ic-weacks 

iphreitu 

iph-rei-tu 

hoacved 

hoac-ved 

iddgo 

idd-go 

ipseg 

ips-eg 

hocbyn 

hoc-byn 

idignos 

i-dig-nos 

irfbick 

irf-bick 
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irinttuo 

i-rint-tu-o 

jejlu 

jej-lu 

jouwi 

jou-wi 

iseunk 

i-seunk 

jektryit 

jek-try-it 

jovcyk 

jov-cyk 

ished 

i-shed 

jekueccu 

je-kuec-cu 

jowbe 

jowbe 

isneept 

is-neept 

jelewnhi 

je-lewn-hi 

jubji 

jub-ji 

issfi 

iss-fi 

jemafry 

je-ma-fry 

jufdyle 

juf-dyle 

isshna 

is-shna 

jenbi 

jen-bi 

juluwi 

ju-lu-wi 

isszeva 

iss-ze-va 

jendd 

jendd 

jumcokdu 

jum-cok-du 

istcy 

ist-cy 

jenho 

jen-ho 

juonan 

ju-o-nan 

itagni 

i-tag-ni 

jeobhyf f 

je-ob-hyff 

juvreg 

juv-reg 

itban 

it-ban 

jerctri 

jerc-tri 

juxlet 

jux-let 

itevud 

i-te-vud 

jerhig 

jer-hig 

kabisesu 

ka-bi-se-su 

ithfoj 

i th-foj 

jeuriram 

jeu-rimra 

kahab juv 

ka-hab-juv 

ithgushe 

ith-gushe 

jewlys 

jew-lys 

kaisi 

kai-si 

itofwaba 

i-tof -wa-ba 

jeybsk 

jeybsk 

kakti 

kak-ti 

itsdeg 

i ts-deg 

jeycafa 

jey-ca-fa 

kavdo 

kav-do 

itsnes 

its-nes 

jeyees 

je-yees 

kavuk 

ka-vuk 

itthou 

i t-thou 

jiepcag 

jiep-cag 

kawuya 

ka-wu-ya 

ivbysu 

iv-by-su 

jifkledd 

jif-kledd 

keabcry 

keab-cry 

ivcrocwi 

iv-croc-wi 

jijnapcy 

ji j-nap-cy 

keatpana 

keat-pa-na 

iveit 

i-veit 

jikdeeb 

jik-deeb 

kebinde 

ke-binde 

ivhee 

iv-hee 

jimra 

jim-ra 

kecblen 

kec-blen 

ivmamts 

iv-mamts 

jindcro 

jind -ero 

kecca 

kec-ca 

ivpirlo 

iv-pir-lo 

jirsh 

jirsh 

keckiva 

kec-ki-va 

izoye 

i-zoye 

jiscasdi 

jis-eas-di 

kefwept 

kef-wept 

jabruxi 

ja-brux-i 

ji tvadec 

jit-va-dec 

keickdi 

keick-di 

jabvel 

jab-vel 

jivbo 

jiv-bo 

ke jbreraa 

ke j-bre-ma 

jadfithi 

jad-fit-hi 

jivroan 

jiv-roan 

ke jrenwa 

ke j-ren-wa 

jadsbeho 

jads-be-ho 

jivunuby 

ji-vu-nu-by 

kekuo 

ke-ku-o 

jagnaveo 

jag-na-ve-o 

jiwioc 

ji-wi-oc 

keruve 

ke-ruve 

jahablad 

ja-ha-blad 

jiwoabca 

ji-woab-ca 

kewvo 

kew-vo 

jahut 

ja-hut 

joafruo 

joa-fru-o 

keybaiya 

key-bai-ya 

jaiwywu 

jai-wy-wu 

jobci 

job-ei 

kiaka 

ki-a-ka 

jakpyto 

jak-py-to 

jocaforr 

jo-ea-forr 

kiaswusp 

ki-a-swusp 

jaumcef 

jaum-cef 

jocci 

joc-ci 

kicuir 

ki-cu-ir 

javdajbo 

jav-da j-bo 

jof  steiz 

jof  s-teiz 

kid jaf 

kid-jaf 

javrai 

jav-mi 

jojegdra 

jo-jeg-dra 

kidubdu 

ki-dub-du 

jayrobyn 

jay-ro-byn 

jolspe 

jolspe 

kifnewfa 

kif-new-f  a 

jeanos 

jea-nos 

jomofe 

jo-raof e 

kisga 

kis-ga 

jebtro 

jeb-tro 

jonho 

jon-ho 

kivan 

ki-van 

jedfotdy 

jed-fot-dy 

jonistsu 

jo-nists-u 

klagge 

klagge 

jeehiwa 

jee-hi-wa 

joparanu 

jo-para-nu 

klecerru 

kle-cer-ru 

jefoulu 

je-fou-lu 

jorka 

jor-ka 

klecwio 

klec-wi-o 

jeguape 

je-gu-ape 

jos tpla 

jost-pla 

klowdno 

klowd-no 

jehilkti 

je-hilk-ti 

jotafcli 

jo-taf-cli 

klure 

klure 

jeinri 

jein-ri 

jothsha 

joth-sha 

klyhibe 

kly-hibe 

jeinvo 

jein-vo 

joucyka 

jou-ey-ka 

knavduew 

knav-duew 
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knect 

knect 

lagmi jy 

lag-mij-y 

lyuwag 

lyu-wag 

knenouke 

kne-nouke 

la jtusu 

la j-tu-su 

lyvas 

ly-vas 

knesa 

kne-sa 

lalfe 

lalfe 

mabka 

mab-ka 

knif  ru 

kni-fru 

lamlu 

lam-lu 

mac jay 

mac-jay 

knige 

knige 

larre 

larre 

madudi 

ma-du-di 

knoacy 

knoa-cy 

larro 

lar-ro 

raakiv 

ma-kiv 

knoji ti 

kno-ji-ti 

larysk 

la-rysk 

malnu 

mal-nu 

knoke 

knoke 

lasawnt 

la-sawnt 

malzata 

mal-za-ta 

knuedcu 

knued-cu 

latalsvi 

la-tals-vi 

marvici 

mar-vi-ci 

knuwa 

knu-wa 

lathheow 

lath-he-ow 

maswo 

ma-swo 

knydglak 

knyd-glak 

lecpotdu 

lec-pot-du 

mavdodu 

mav-do-du 

knyings 

kny-ings 

leilve 

leilve 

mawfbeny 

mawf -ben-y 

knywo 

kny-wo 

lemgure 

lem-gure 

may rut 

may-rut 

kobvilwo 

kob-vil-wo 

lenhohay 

len-ho-hay 

meapafku 

mea-paf-ku 

kofeto 

ko-f e-to 

leoneth 

le-o-neth 

raedcahu 

med-ca-hu 

koinbi 

koin-bi 

lerface 

ler-face 

meeljilb 

meel-jilb 

kojrup 

koj-rup 

lesobkuo 

le-sob-ku-o 

mehuvo 

me-hu-vo 

kojya 

koj-ya 

levscyva 

lev-scy-va 

meise 

meise 

kokbrye 

kok-brye 

lezda 

lez-da 

mem jel 

mem-jel 

kokfuo 

kok-f u-o 

lezycrec 

le-zy-crec 

raemyn 

me-myn 

koleesce 

ko-leesce 

liada 

li-a-da 

mencu 

men-cu 

kolskja 

kolsk-ja 

lidnogya 

lid-nog-ya 

mepdyo 

mep-dyo 

kooche 

kooche 

lifgak 

lif-gak 

mepoohej 

me-poo-hej 

kophja 

koph-ja 

liloc 

li-loe 

metupjob 

me-tup-job 

kosujpa 

ko-suj-pa 

lisjo 

lis-jo 

mevmewif 

mev-rae-wif 

kotvads 

kot-vads 

liulm 

li-ulm 

mexcu 

mex-cu 

kovshya 

kov-shya 

liunawyd 

li-u-na-wyd 

miachaft 

mi-ac-haft 

koybji 

koyb-ji 

lofovu 

lo-fo-vu 

mif jas 

mif-jas 

koywo 

koy-wo 

logsjo 

logs-jo 

migshyje 

mig-shyje 

kraincy 

krain-cy 

lokosmin 

lo-kos-min 

raihuty 

mi-hu-ty 

krapiep 

kra-piep 

lolhuf 

lol-huf 

mikug 

mi-kug 

krarracu 

krar-ra-cu 

lophyo 

loph-yo 

minri 

min-ri 

krete 

krete 

lopnute 

lop-nute 

miosvia 

mi-os-vi-a 

krethuto 

kre-thu-to 

lorpcier 

lorp-cier 

misvago 

mis-va-go 

krivo 

kri-vo 

lorphohu 

lor-pho-hu 

miukba 

mi-uk-ba 

krochis 

kroc-his 

ludnu 

lud-nu 

mizpi 

miz-pi 

kromjo 

krom-jo 

lutatkie 

lu-tat-kie 

modheca 

mod-he-ca 

krovew 

kro-vew 

lyavdyg 

lyav-dyg 

moithi 

moi-thi 

kruof e 

kru-of e 

lycaff 

ly-caff 

mo  jmo 

mo  j-mo 

kryov 

kryov 

lycel 

ly-cel 

momil 

mo-mil 

kucvuko 

kuc-vu-ko 

lygefgi 

ly-gef-gi 

moothalp 

moo-thalp 

kugwog 

kug-wog 

ly jeehok 

ly-jee-hok 

mowopy 

mo-wo-py 

kulsh 

kulsh 

lykzybi 

lyk-zy-bi 

mufiho 

mu-f i-ho 

kumvicnu 

kum-vic-nu 

lyruvpi 

ly-ruv-pi 

muijcry 

mu-i j-cry 

kuvhy 

kuv-hy 

lythowa 

ly-tho-wa 

rauthoby 

mu-tho-by 

lagiu 

la-gi-u 

lytis 

ly-tis 

mutiplyk 

mu-tip-lyk 
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myhati 

my-ha-ti 

nemaserr 

ne-ma-serr 

nubhyev 

nub-hyev 

myhow 

my-how 

nenair 

ne-nair 

nuchso 

nuch-so 

myifeeg 

my-i-feeg 

nenomdu 

ne-nom-du 

nucwror 

nuc-wror 

mykna 

my-kna 

neokipi 

ne-o-ki-pi 

nuishfi 

nu-ish-fi 

myrofed 

my-ro-fed 

neppri 

nepp-ri 

nulnot 

nul-not 

mysgere 

mys-gere 

nerga 

ner-ga 

nuvnilt 

nuv-nilt 

mytetvif 

my-tet-vif 

neshghee 

nesh-ghee 

nuvocida 

nu-vo-ci-da 

mythov 

my-thov 

nesob 

ne-sob 

nuvoinzi 

nu-voin-zi 

nacgryo 

nac-gryo 

nethsknu 

neths-knu 

nuwynip 

nu-wy-nip 

nadasthu 

na-das-thu 

neudu 

neu-du 

oabfu 

oab-fu 

nad  vof 

nad-vof 

newexam 

ne-wex-am 

oadedglu 

oa-ded-glu 

nafba 

naf-ba 

newmlo 

newm-lo 

oakdydnu 

oak-dyd-nu 

nafcybcy 

naf -cyb-cy 

neyfedwy 

ney-fed-wy 

oatokhot 

oa-tok-hot 

nafohu 

na-fo-hu 

neywyep 

ney-wyep 

oawojion 

oa-wo-ji-on 

nagejtu 

na-gej-tu 

nicja 

nic-ja 

obsso 

obs-so 

naibfon 

naib-fon 

nidin 

ni-din 

obvipye 

ob-vi-pye 

nakro 

nak-ro 

nievdiv 

niev-div 

oceppdif 

o-cepp-dif 

nalblu 

nal-blu 

niezokyu 

nie-zok-yu 

ockprad 

ock-prad 

naldyd 

nal-dyd 

nijwyd 

nij-wyd 

ocnoy 

oc-noy 

narame 

namme 

nikinkyo 

ni-kink-yo 

octrye 

oct-rye 

napom 

na-pom 

nilmku 

nilm-ku 

octtin 

oct-tin 

nashiwra 

na-shi-wra 

niness 

ni-ness 

odcrye 

od-crye 

nastu 

nas-tu 

ninmy 

nin-my 

odcuka 

od-cu-ka 

nattva 

natt-va 

nipcu 

nip-cu 

odhece 

od-hece 

natynhim 

na-tyn-him 

nipha 

ni-pha 

odkos 

od-kos 

nauwaf 

nau-waf 

nishgryo 

nish-gryo 

odvewju 

od-vew-ju 

navfebda 

nav-feb-da 

nisna 

nis-na 

odwreec 

od-wreec 

nawflent 

naw-flent 

niwair 

ni-wair 

ofcai 

of-cai 

nayi tyha 

na-yi-ty-ha 

nocar 

no-car 

ofnaycy 

of-nay-cy 

naykarci 

nay-kar-ci 

nocci 

noc-ci 

ofnic 

of-nic 

necprya 

nec-prya 

noceete 

no-ceete 

of  way 

of-way 

necra 

nec-ra 

noclot 

noc-lot 

ofyad 

of-yad 

necucucu 

ne-cu-cu-cu 

nocoss 

no-coss 

ofyewn 

of-yewn 

neeredeu 

nee-re-deu 

nodyo 

no-dyo 

ogbra 

og-bra 

neergtec 

neerg-tec 

nof tha 

nof-tha 

ogfata 

og-fa-ta 

nefri 

ne-fri 

noisspa 

noiss-pa 

ogiwru 

o-gi-wru 

negagut 

ne-ga-gut 

nokroi 

nok-roi 

ogjipca 

og-jip-ca 

negip 

ne-gip 

nolsht 

nolsht 

oiboay 

oi-boay 

negot 

ne-got 

noogo 

noo-go 

oidvebs 

oid-vebs 

nehidpro 

ne-hid-pro 

nopykfa 

no-pyk-fa 

oighcabo 

oigh-ca-bo 

nehiru 

ne-hi-ru 

noudcen 

noud-cen 

oighir 

oig-hir 

neice 

neice 

nourycel 

nou-ry-cel 

oiranib 

oim-nib 

neipvan 

neip-van 

novcias 

nov-ci-as 

oimut 

oi-mut 

nelrie 

nel-rie 

noxna 

nox-na 

oisckuma 

oisck-u-ma 

nelshjer 

nelsh-jer 

noyce 

noyce 

oiskasa 

ois-ka-sa 

neltoag 

nel-toag 

noydu 

noy-du 

oitha 

oit-ha 

122 


(continued) 


oitques 

oit-ques 

otkria 

ot-kri-a 

pheaglo 

phea-glo 

oitredi 

oit-re-di 

otlif 

ot-lif 

pheapomp 

phea-pomp 

ojenty 

o-jen-ty 

otost 

o-tost 

phelti 

phel-ti 

ojheibu 

oj-hei-bu 

otvayd 

ot-vayd 

phijca 

phi j-ca 

ojikgi 

o-jik-gi 

otynro 

o-tyn-ro 

phlegva 

phleg-va 

ojkojdy 

oj-koj-dy 

oudti 

oud-ti 

phless 

phless 

ojojoga 

o-jo-jo-ga 

ourhogs 

our-hogs 

phlief 

phlief 

ojphov 

oj-phov 

ourri 

our-ri 

phubilst 

phu-bilst 

ojstro 

oj-stro 

oury jau 

ou-ry-jau 

phunogso 

phu-nogs-o 

ojugcry 

o-jug-cry 

ovcyfen 

ov-cy-f  en 

phuwo 

phu-wo 

oka  jo 

o-ka-jo 

ovglyt 

ov-glyt 

pictten 

pict-ten 

okcewyve 

ok-ce-wyve 

ovgroo 

ov-groo 

pijted 

pij-ted 

oksibo 

oks-i-bo 

ovitte 

o-vitte 

pingsvu 

pings-vu 

olactmam 

o-lact-mam 

ovo jetig 

o-vo-je-tig 

piyiffe 

pi-yiffe 

olahit 

o-la-hit 

ovriso 

ov-ri-so 

plaught 

plaught 

olshlely 

ol-shle-ly 

ovsmiri 

ov-srai-ri 

plecpu 

plec-pu 

olsli 

ol-sli 

ovteyo 

ov-te-yo 

pleett 

pleett 

olswodpa 

ol-swod-pa 

ovvuc 

ov-vuc 

plivu 

pli-vu 

ombomo 

om-bo-mo 

owjope 

ow-jope 

ployhat 

ploy-hat 

omreddwu 

om-redd-wu 

owpciovo 

owp-ci-o-vo 

plydahoi 

ply-da-hoi 

onbloth 

on-bloth 

oydjali 

oyd-ja-li 

pokjij 

pok-ji j 

onbro 

on-bro 

oyouvers 

o-you-vers 

pothef fi 

po-thef-f i 

ondlunya 

ond-lun-ya 

oyt jefi 

oyt-je-fi 

povboula 

pov-bou-la 

ondpliak 

ond-pli-ak 

ozuovgu 

o-zu-ov-gu 

pozcygt i 

poz-cyg-ti 

onoxe 

o-noxe 

padtra 

pad-tra 

pracfrob 

prac-frob 

onst ju 

onst-ju 

pailp 

pailp 

pratardo 

pra-tar-do 

ontdoi 

ont-doi 

pajles 

paj-les 

prorj 

prorj 

onuokto 

o-nu-ok-to 

pakyith 

pak-yith 

proucha 

prou-cha 

oocdet 

ooc-det 

pauvweu 

pauv-weu 

pryee 

pryee 

oocre 

oocre 

pawdedo 

pawd-e-do 

pryin 

pry-in 

oosnorgu 

oos-nor-gu 

pawubi 

pa-wu-bi 

pryitkek 

pry-it-kek 

ootdyhoa 

oot-dy-hoa 

paywu 

pay-wu 

pudaps 

pu-daps 

openk 

o-penk 

pedoghof 

pe-do-ghof 

puits 

pu-its 

ophthuig 

oph-thu-ig 

pedti 

ped-ti 

pujbiagy 

puj-bi-ag-y 

oprayti 

op-my-ti 

peetieg 

pee-tieg 

punipciv 

pu-nip-civ 

opyti 

o-py-ti 

pef  carr 

pef -carr 

purvmyoj 

purv-myoj 

orgami 

or-ga-mi 

pejvonga 

pej-von-ga 

pusadeta 

pu-sa-de-ta 

oroaba 

o-roa-ba 

pekcag 

pek-cag 

pyadvu 

pyad-vu 

orquej 

or-que j 

pesdray 

pes-dray 

pydaft 

py-daft 

ortta 

ort-ta 

peshegha 

pes-he-gha 

pydtejtu 

pyd-tej-tu 

osgroso 

os-gro-so 

pesshmu 

pes-shmu 

pyjukra 

py-juk-ra 

oshgegtu 

osh-geg-tu 

pexgeuxa 

pex-geux-a 

pypyoco 

py-pyo-co 

oswev 

o-swev 

phache 

phache 

pyrabraal 

py-rab-mal 

otgraf s 

ot-grafs 

phacwef 

phac-wef 

pyris 

py-ris 

othivdo 

o-thiv-do 

phagfi 

phag-fi 

pyuijhi 

pyu-ij-hi 

othnoak 

oth-noak 

phapvez 

phap-vez 

quagro 

qua-gro 
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quavu 

qua-vu 

reyja 

rey-ja 

roudu 

rou-du 

quaytha 

quay-tha 

rhicwo 

rhic-wo 

rovovu 

ro-vo-vu 

quedyed 

que-dyed 

rhiemt 

rhiemt 

ruadpi 

ru-ad-pi 

queske 

queske 

rhilke 

rhilke 

rugnur 

rug-nur 

quiapho 

qui-ap-ho 

rhoinko 

rhoin-ko 

ruici 

ru-i-ci 

quibs 

quibs 

rhynute 

rhy-nute 

ru  jno 

ruj-no 

rabpi 

rab-pi 

rialqua 

ri-al-qua 

rumcawt 

rum-cawt 

racrinn 

rac-rinn 

riasa 

ri-a-sa 

runkzi 

runk-zi 

radpildd 

rad-pildd 

riccea 

ric-cea 

ruppbedd 

rupp-bedd 

raihusav 

rai-hu-sav 

ricda 

ric-da 

rutwece 

ru-twece 

raimmp 

raimmp 

ricio 

ri-ci-o 

ruwabkid 

ru-wab-kid 

rainegha 

rai-ne-gha 

ricki 

rie-ki 

ruwik 

ru-wik 

raj  vow 

raj -vow 

riclea 

ric-lea 

ruyadpoy 

ru-yad-poy 

rakyac 

rak-yac 

ricnu 

ric-nu 

ruyucvi 

ru-yuc-vi 

ramdru 

ram-dru 

ricsi 

rics-i 

ryane 

ryane 

ranwye 

ran-wye 

ridatcee 

ri-dat-cee 

rychyg 

ry-chyg 

rarenbu 

ra-ren-bu 

rietvue 

riet-vue 

rycitwev 

ry-ci-twev 

rar jnu 

rar j-nu 

rijacas 

ri-ja-cas 

rycoabco 

ry-coab-co 

ratvuld 

rat-vuld 

ri jreci 

ri j-re-ci 

rycoye 

ry-coye 

ravef 

ra-vef 

ri jviki 

ri j-vi-ki 

rydilb 

ry-dilb 

ravek 

ra-vek 

rimavmaw 

ri-mav-maw 

ry  fladgi 

ry-flad-gi 

rawej 

ra-wej 

rinen 

ri-nen 

ryjobo 

ry-jo-bo 

rayceoc 

ray-ce-oc 

rinnphi 

rinn-phi 

ryjupo 

ry-ju-po 

rebhet 

reb-het 

riphha 

riph-ha 

rykty 

ryk-ty 

rebku 

reb-ku 

ripni 

rip-ni 

rylynswy 

ry-lyn-swy 

rebu joca 

re-bu-jo-ca 

ristrewp 

rist-rewp 

rymro 

rym-ro 

reddy 

red-dy 

ritas 

ri-tas 

rynadoge 

ry-na-doge 

redjeyon 

red-je-yon 

rivgipav 

riv-gi-pav 

ryouv 

ryouv 

reedbi 

reed-bi 

rivoam 

ri-voam 

ryt jior 

ryt-ji-or 

reedphot 

reed-phot 

rivva 

riv-va 

rytmodmu 

ryt -mod -mu 

reeygu 

reey-gu 

riwrida 

ri-wri-da 

ryuajir 

ryu-a-jir 

regcaib 

reg-caib 

riwyac 

ri-wyac 

ryuwa 

ryu-wa 

regsfa 

regs-fa 

roadswus 

roads-wus 

rywictfi 

ry-wict-fi 

reifiec 

rei-fieo 

roadtu 

road-tu 

sad jip 

sad-jip 

rejayoi 

re-ja-yoi 

rocwain 

roc-wain 

sadol 

sa-dol 

rejsnan 

re j-snan 

rodbla 

rod-bla 

sahoarvy 

sa-hoarv-y 

remwru 

rem-wru 

rokakny 

ro-ka-kny 

sahuv 

sa-huv 

renmo 

ren-mo 

rokkect 

rok-kect 

salnodu 

sal-no-du 

reproo 

rep-roo 

rolety 

ro-le-ty 

sanfo 

san-fo 

rerai j 

re-raij 

ronso 

ron-so 

sanomt 

sa-nomt 

rerjby 

rerj-by 

roocwu 

rooc-wu 

satnan 

sat-nan 

rerjneav 

rer j-neav 

roombrac 

room-brac 

savki 

sav-ki 

reshbuf 

resh-buf 

ropflaty 

rop-fla-ty 

sawtoyad 

saw-to-yad 

rethe 

rethe 

ror jam 

ror-jam 

saysbu 

says-bu 

revkrey 

rev-krey 

r os fat 

ros-fat 

sceeltfu 

sceelt-fu 

reyff 

reyff 

rothko 

roth-ko 

schesy 

sche-sy 
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sciobo 

sci-o-bo 

sorifri 

so-ri-fri 

tethnan 

teth-nan 

screa 

screa 

spabu 

spa-bu 

tevevi 

te-ve-vi 

scuocu 

scu-o-cu 

speitoto 

spei-to-to 

teybwu 

teyb-wu 

sedvude 

sed-vude 

spibsmu 

spibs-mu 

thasachi 

tha-sa-chi 

seif  rav 

seif-rav 

spipvigh 

spip-vigh 

thatbogs 

that-bogs 

seisap 

sei-sap 

spoi jtoc 

spoi j-toc 

thebaims 

the-baims 

sejtheb 

sej-theb 

spyfodd 

spy-fodd 

thefts 

thefts 

sewocle 

se-wocle 

spyun 

spyun 

theillpa 

theill-pa 

sheccu 

shec-cu 

stacso 

stacs-o 

thepcli 

thep-cli 

sheguogy 

she-gu-og-y 

stanod 

sta-nod 

thets 

thets 

shere 

shere 

stier 

stier 

thevu 

the-vu 

shikgevi 

shik-ge-vi 

strongha 

stron-gha 

thiarlfo 

thi-arl-fo 

shiwyt 

shi-wyt 

stuxa 

stux-a 

thojca 

thoj-ca 

shlabry 

shla-bry 

stycip 

sty-cip 

thona 

tho-na 

shmeithi 

shraeit-hi 

stylig 

sty-lig 

thradfi 

thrad-f i 

shnujdog 

shnuj-dog 

subfomi 

sub-fo-mi 

threzbo 

threz-bo 

shoawn 

shoawn 

swetknu j 

swet-knuj 

thridd 

thridd 

shrapdro 

shrap-dro 

swoghu 

swog-hu 

throaga 

throa-ga 

shtee 

shtee 

swoyfi 

swoy-fi 

throyott 

thro-yott 

shtiadeg 

shti-a-deg 

swur juss 

swur-juss 

thrup 

thrup 

shtild 

shtild 

swurny 

swurn-y 

thud  a 

thu-da 

shyib 

shy-ib 

syavo 

sya-vo 

thuisko 

thu-is-ko 

siasics 

si-a-sics 

sybdog 

syb-dog 

thwiwil 

thwi-wil 

sijty 

sij-ty 

syesjats 

syes-jats 

thyaym 

thyaym 

sklima 

skli-ma 

taccy 

tac-cy 

thylu 

thy-lu 

skrec 

skrec 

taceszy 

ta-ces-zy 

tiadd 

ti-add 

skyinid 

sky-raid 

taiset 

tai-set 

tibhay 

tib-hay 

slamfod 

slara-fod 

tajsciko 

taj-sci-ko 

tifdol 

tif-dol 

slasu 

sla-su 

taj  va 

taj-va 

tificbu 

ti-fic-bu 

slu jna 

slu j-na 

tajyaryb 

taj-ya-ryb 

tihadna 

ti-had-na 

slyhy 

sly-hy 

talmptwu 

talmpt-wu 

tilocoy 

ti-lo-coy 

slyos 

slyos 

tarorg 

ta-rorg 

tionu 

ti-o-nu 

slyry 

sly-ry 

tawsdi 

taws-di 

tiowark 

ti-o-wark 

slyshra 

sly-shra 

tebes 

te-bes 

tipgi 

tip-gi 

sraoyceko 

smoy-ce-ko 

tedoa 

te-doa 

tirpikdi 

tir-pik-di 

snemuews 

sne-muews 

teeji 

tee-ji 

titshcuk 

titsh-cuk 

snenbi 

snen-bi 

teerny 

teern-y 

titwes 

ti-twes 

snepvu 

snep-vu 

teivropa 

teiv-ro-pa 

tivned 

tiv-ned 

snoith 

snoith 

temblino 

tem-bli-no 

tiyifa 

ti-yi-fa 

snomwrul 

snom-wrul 

tenort 

te-nort 

toccuafi 

toc-cu-a-fi 

snotic 

sno-tic 

tenragwu 

ten-rag-wu 

tocon 

to-con 

snupdo 

snup-do 

teolub 

te-o-lub 

tokget 

tok-get 

snynju 

snyn-ju 

tequeaf 

te-queaf 

tokssu 

toks-su 

sodcy 

sod-cy 

terch 

terch 

tomfrys 

tom-frys 

sodnacti 

sod-nac-ti 

terir 

te-rir 

tosbi 

tos-bi 

sofbri 

sof-bri 

terynvoa 

te-ryn-voa 

trafciv 

traf-civ 
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trebo 

tre-bo 

udweryho 

ud-we-ry-ho 

vagyoo 

vag-yoo 

treormer 

tre-or-mer 

udygnedi 

u-dyg-ne-di 

va iwru 

vai-wru 

tretozwa 

tre-toz-wa 

ufdu jmod 

uf-duj-mod 

valco 

val-co 

tretrado 

tret-ra-do 

uf jeaki 

uf-jea-ki 

vardvu 

vard-vu 

trickacy 

trick-a-cy 

ufkodbot 

uf-kod-bot 

vargro 

var-gro 

trida 

tri-da 

ufwata 

uf-wa-ta 

vasce 

vasce 

trilmklu 

trilm-klu 

ugazsca 

u-gaz-sca 

vashcli 

vash-cli 

tripre 

tripre 

ugnea 

ug-nea 

vassvu 

vass-vu 

trodor 

tro-dor 

ugveje 

ug-veje 

veagfags 

veag-fags 

tronksya 

tronks-ya 

ujeshump 

u-je-shump 

vebobo 

ve-bo-bo 

trosnayn 

tros-nayn 

ujhejda 

uj-hej-da 

vedim 

ve-dim 

tryhow 

try-how 

ujvoba 

uj-vo-ba 

vedtwo 

ved-two 

tudrymm 

tu-drymm 

ukogchi 

u-kog-chi 

veejplo 

veej-plo 

tuduyen 

tu-du-yen 

ukuckody 

u-kuck-o-dy 

veemreur 

veem-reur 

tuesh 

tuesh 

ukyid 

uk-yid 

vefhyo 

ve f-hyo 

tufhaf 

tuf-haf 

ultic 

ul-tic 

vegcopco 

veg-cop-co 

tugdu 

tug-du 

undau 

un-dau 

vegewal 

ve-ge-wal 

tugoyu 

tu-go-yu 

unraksom 

un-raks-om 

vegontha 

ve-gon-tha 

tuivfu 

tu-iv-f  u 

untduaci 

unt-du-a-ci 

vegti 

veg-ti 

tujestty 

tu- jest-ty 

unvawth 

un-vawth 

vegwa 

veg-wa 

tukici 

tu-ki-ci 

unvuecad 

un-vue-cad 

vejlyen 

vej-lyen 

tul jtank 

tul j-tank 

uphdelyk 

uph-de-lyk 

veocplu 

ve-oc-plu 

tuownsmi 

tu-owns-mi 

uphij 

u-phij 

vetda 

vet-da 

tuthu 

tu-thu 

upsha 

up-sha 

ve tdeif 

vet-deif 

twe jut 

twe-jut 

urhodri 

ur-ho-dri 

vevma 

vev-ma 

twelyte 

twe-lyte 

urshfo 

ursh-fo 

viaja 

vi-a-ja 

twepvou 

twep-vou 

ushgat 

ush-gat 

viaklu 

vi-ak-lu 

twufoist 

twu-foist 

us juhy 

us-ju-hy 

vibdi ju 

vib-di-ju 

twyvaswo 

twy-va-swo 

uskcruk 

usk-cruk 

viclup 

vic-lup 

tyciaqua 

ty-ci-a-qua 

uskluds 

usk-luds 

viconow 

vi-co-now 

tylywiv 

ty-ly-wiv 

usktyse 

usk-tyse 

vi jeicbo 

vi-jeic-bo 

tyocle 

tyocle 

usspt 

usspt 

vi  jku 

vij-ku 

typayryo 

ty-pay-ryo 

usvuruij 

us-vu-ru-ij 

vill ja 

vill-ja 

typeaf 

ty-peaf 

utcerd 

ut-cerd 

vinomjo 

vi-nom-jo 

tytba 

tyt-ba 

utewade 

u-te-wade 

vinuhici 

vi-nu-hi-ci 

ubcedu 

ub-ce-du 

utwub 

u-twub 

viomlop 

vi-om-lop 

ubpiji 

ub-pi-ji 

uvnoosod 

uv-noo-sod 

virte 

virte 

ubpru 

ub-pru 

uvovveo 

u-vov-ve-o 

visjehy 

vis-je-hy 

ubwru 

ub-wru 

uvubsoi 

u-vub-soi 

vi sviens 

vis-viens 

ubybfrex 

u-byb-frex 

vab  ju 

vab-ju 

vitey fu 

vi-tey-fu 

ucdapcib 

uc-dap-cib 

vabriho 

va-bri-ho 

vobmocgi 

vob-moc-gi 

ucecai 

u-ce-cai 

vacquic 

vac-quic 

vociwen 

vo-ci-wen 

ucgij 

uc-gij 

vadfuo 

vad-f  u-o 

vogtho 

vog-tho 

uchiogid 

u-chi-o-gid 

vadwyhey 

vad-wy-hey 

voithiyo 

voi-thi-yo 

udaufmi 

u-dauf -mi 

vadyjo 

va-dy-jo 

voitmu 

voit-mu 

udodd 

u-dodd 

vaf jitdu 

vaf-jit-du 

vojcrigg 

voj-crigg 
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voldyu 

vol-dyu 

wiabko 

wi-ab-ko 

wucuca 

wu-cu-ca 

vopsbyk 

vops-byk 

widgawa 

wid-ga-wa 

wupkuamu 

wup-ku-a-mu 

vorgdra 

vorg-dra 

wieci 

wie-ci 

wurctivi 

wurc-ti-vi 

vortho 

vor-tho 

wief  u 

wie-fu 

wuyeawed 

wu-yea-wed 

vorwajel 

vor-wa-jel 

wifadal 

wi-fa-dal 

wybylel 

wy-by-lel 

voumdy 

voum-dy 

wifvenod 

wif-ve-nod 

wyclinoc 

wy-cli-noc 

vowgta 

vowg-ta 

wigawoz 

wi-ga-woz 

wygdrel 

wyg-drel 

voysblo 

voys-blo 

wigdosim 

wig-do-sim 

wykya 

wyk-ya 

vuaby 

vu-a-by 

wi jwoa 

wi j-woa 

wyojyilm 

wyoj-yilm 

vuccorer 

vuc-co-rer 

wimac 

wi -mac 

wyosevu 

wyo-se-vu 

vudrawm 

vu-drawm 

wimosu 

wi-mo-su 

wyrahu 

wy-ra-hu 

vufgoaja 

vuf-goa-ja 

winect 

wi-nect 

wyril 

wy-ril 

vulynri 

vu-lyn-ri 

wirro 

wir-ro 

wyvotded 

wy-vot-ded 

vunlusho 

vun-lu-sho 

witlugdu 

wit-lug-du 

yacrad 

yac-rad 

vupwoa 

vup-woa 

wivtuje 

wiv-tuje 

yafduhev 

yaf-du-hev 

vuquatu 

vu-qua-tu 

wiyei ju 

wi-yei-ju 

yafrig 

ya-frig 

vuyeevo 

vu-yee-vo 

wiyicyn 

wi-yi-cyn 

yaicjopo 

yaic-jo-po 

vuyovitt 

vu-yo-vitt 

wiyi jko 

wi-yi j-ko 

yaisi 

yai-si 

wabr  n 

wa-ban 

wizpio 

wiz-pi-o 

yaiyew 

yai-yew 

wacase 

wa-case 

wobloyky 

wo-bloyk-y 

yajtuv 

yaj-tuv 

wadyt 

wa-dyt 

woimnav 

woim-nav 

yakca 

yak-ca 

wag  my 

wag-my 

woiwue 

woi-wue 

yakvacgo 

yak-vac-go 

wandyb 

wan-dyb 

wojyu 

wo  j-yu 

yakyafa 

yak-ya-fa 

wapawwo 

wa-paw-wo 

woknut 

wo-knut 

yamlovna 

yam-lov-na 

warogi 

wa-ro-gi 

wonba 

won-ba 

yarayen 

ya-myen 

wavku 

wav-ku 

wonwi jmy 

won-wij-my 

yands 

yands 

wavsnu 

wav-snu 

wopho 

wo-pho 

yapegu 

ya-pe-gu 

weday 

we-day 

wopku 

wop-ku 

yar jam ju 

yar-jam-ju 

weenipho 

wee-ni-pho 

worlegig 

wor-le-gig 

yasbut 

yas-but 

wegoka 

we-go-ka 

wovru 

wov-ru 

yashgo 

yash-go 

wekbu 

wek-bu 

woy  fum 

woy-f urn 

yasspka 

yassp-ka 

wemekchi 

we-mek-chi 

woytaict 

woy-taict 

yayigca 

ya-yig-ca 

wepri 

wep-ri 

wrahiud 

wra-hi-ud 

yayjint 

yay-jint 

werho 

wer-ho 

wrahok 

wra-hok 

yebjo 

yeb-jo 

wetukfi 

we-tuk-fi 

wrarjhu 

wrarj-hu 

yeceag 

ye-ceag 

weutak 

weu-tak 

wreasgo 

wreas-go 

yecwrur 

yec-wrur 

whacfida 

whac-f i-da 

wr  egy 

wreg-y 

yefanaki 

ye-fa-na-ki 

whawjel 

whaw-jel 

wreng 

wreng 

yefeit 

ye-feit 

whedcli 

whed-cli 

wrerry 

wrer-ry 

yefhy 

yef-hy 

whetcrul 

whet-crul 

wri jtrur 

wrij-trur 

yefleoka 

ye-fle-o-ka 

whokyohu 

whok-yo-hu 

wrocpish 

wroc-pish 

yegoce 

ye-goce 

whosdeb 

whos-deb 

wroije 

wroije 

yejut 

ye-jut 

whoyhyo 

whoy-hyo 

wruvpi 

wruv-pi 

yekcoghu 

yek-cog-hu 

whyabdi 

whyab-di 

wry fsru 

wryfs-ru 

yekcu 

yek-cu 

why  je 

why  je 

wuajqui 

wu-aj-qui 

yekjiva 

yek-ji-va 

whyshiv 

why-shiv 

wuavsyo 

wu-av-syo 

yenrit 

yen-rit 
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yepros  yep-ros 
yerelifs  ye-re-lifs 
yesfryac  yes-fryac 
yesps  yesps 

ye tmawoi  yet-ma-woi 
yevon  ye-von 

yewlibol  yew-li-bol 
yewnshlo  yewn-shlo 
yexpli  yex-pli 
yibfeem  yib-feem 
yibluv  yib-luv 
yibnen  yib-nen 
yiccoha  yic-co-ha 
yictryti  yict-ry-ti 
yidbogdo  yid-bog-do 
yidjetli  yid-jet-li 
yiemtak  yiem-tak 
yigfrale  yig-frale 
yijwis  yij-wis 
yikeej  yi-keej 
yiklo  yik-lo 

yilak  yi-lak 

yilvdu  yilv-du 


yipweksa 

yip-weks-a 

yitjeyef 

yit-je-yef 

yitnes ju 

yit-nes-ju 

yivgadso 

yiv-gads-o 

yivos 

yi-vos 

yivvi 

yiv-vi 

yoarmoi 

yoar-moi 

yobcruro 

yob-cru-ro 

yocefkan 

yo-cef-kan 

yocgi 

yoc-gi 

yocuct 

yo-cuct 

yogieth 

yo-gieth 

yogway 

yog-way 

yombo 

yora-bo 

yomciub 

yom-ci-ub 

yonmeg 

yon-meg 

yopojvob 

yo-poj-vob 

yororcdi 

yo-rorc-di 

yotylica 

yo-ty-li-ca 

yovshmu 

yov-shmu 

yoyweufa 

yoy-weu-fa 

yuebkovu 

yueb-ko-vu 

yuetyo 

yue-tyo 

yufugha 

yu-fu-gha 

yugmopyz 

yug-mo-pyz 

yuhienfa 

yu-hien-fa 

yuitdib 

yu-it-dib 

yunny 

yunn-y 

yuondyd 

yu-on-dyd 

yupshty 

yup-shty 

yupwi jat 

yup-wi-jat 

yureppri 

yu-repp-ri 

yurytha 

yu-ry-tha 

yuyitboc 

yu-yit-boc 

zaney 

za-ney 

zatshdu 

zatsh-du 

zatvel 

zat-vel 

zefphlie 

zef-phlie 

zeyel 

ze-yel 

zodagbo 

zo-dag-bo 

zujuce 

zu- juce 

zwiufe 

zwi-ufe 

zwudrul 

zwu-drul 

zyipunpa 

zy-i-pun-pa 

zylocwyt 

zy-loc-wyt 
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STATISTICS 


In  Section  IV  complete  data  was  presented  for  an  analysis  of  ran- 
dom words  of  eight  letters.  The  two  figures  in  this  appendix  are  the 
probability  distribution  curves  for  words  of  six  and  ten  letters,  sim- 
ilar to  that  shown  in  figure  4 for  eight  letter  words. 
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Figure  6.  Distribution  of  Probabilities  of  5893  Six  Letter  Words 
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Figure  7.  Distribution  of  Probabilities  of  1039  Ten  Letter  Words 
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DOCUMENTATION 


The  documentation  on  the  programs  contained  in  this  appendix  is 
in  the  form  of  Multics  Programmers'  Manual  command  and  subroutine  de- 
scriptions. [3]  The  individual  write-ups  are  in  alphabetical  order. 
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columns 


Command 


Name : columns,  col 

The  columns  command  will  reformat  a segment  consisting  of  short 
lines  into  columns  that  read  vertically  down  the  page.  The  number  of 
columns  that  will  appear  depends  on  the  length  of  the  longest  line  in 
the  segment  and  the  length  of  the  output  line.  The  reformatted 
segment  is  printed  on  the  terminal  with  blank  lines  between  pages,  or 
can  be  stored  for  dprinting. 

Usage 


columns  pathname  -control_args- 

1)  pathname  Name  of  the  segment  to  be  reformatted.  Lines  in  the 

segment  may  be  of  any  length  up  to  132  characters, 
and  all  lines  need  not  be  the  same  length. 

2)  control_args  are  one  or  more  of  the  following: 

-segment,  -sm  If  present,  this  argument  specifies  that  the 
reformatted  segment  will  be  stored  in  a segment 
called  pathname . columns,  in  a format  suitable  for 
dprint.  If  this  argument  does  not  appear,  output 
will  be  printed  on  the  terminal. 

-line_lenth  nn,  -11  nn 

If  present,  nn  is  the  length  of  line  to  be  used  for 
output.  This  must  be  a number  in  the  range  1 to  132. 
If  this  argument  is  missing,  the  length  of  line  used 
will  be  132  for  the  -segment  option,  or  the  length  of 
the  terminal  output  line  if  -segment  is  not 
specified.  If  the  user  is  absentee  or  file_output  is 
being  used,  a length  of  132  will  be  used. 

-page_length  nn,  -pi  nn 

This  argument  sets  the  length  of  the  page  produced  by 
columns.  If  output  is  to  the  terminal,  nn  lines  will 
be  printed  on  each  page,  and  blank  lines  will  be  used 
to  pad  the  end  of  each  page  up  to  a total  of  66  lines 
per  page.  If  output  is  to  a segment,  a page  will  be 
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ejected  every  nn  lines.  The  default  page  length  is 
60  lines.  This  control  argument  is  incompatible  with 
the  -no_pagi nation  control  argument. 

-adjust,  -ad  specifies  that  the  blank  space  between  the  columns  is 
to  be  adjusted  so  that  the  maximum  amount  of  white 
space  appears  between  the  columns.  If  this  control 
argument  does  not  appear,  there  will  always  be  one 
blank  space  between  the  columns.  Note  that  the 
number  of  columns  to  be  printed  is  not  affected  by 
the  use  of  this  control  argument.  The  only  effect  is 
to  possibly  add  some  blanks  between  the  columns  that 
would  otherwise  appear  at  the  end  of  each  line  of 
output. 

-no_pagi nation,  -npgn 

This  argument  specifies  that  the  output  is  not  to  be 
paged,  i.e.,  the  page  length  is  assumed  to  be 
infinite.  This  argument  is  useful  for  terminal 
output  when  page  breaks  are  not  desired,  and  avoids 
extra  blank  lines  at  the  end  of  the  last  page. 

Notes 


The  command  first  determines  how  many  columns  can  fit  on  a line 
by  scanning  the  segment  for  the  longest  line,  and  using  that  length  as 
the  width  of  each  column.  If  -adjust  is  not  specified,  there  will  be 
one  blank  space  between  columns,  otherwise  any  extra  space  will  be 
inserted  between  the  columns.  Lines  from  the  input  segment  that  are 
shorter  than  the  longest  line  will  be  left  justified  in  the  columns. 

When  the  -segment  option  is  not  specified  (and  -no_pagi nation  is 
not  specified),  columns  will  put  60  lines  per  page,  with  3 blank  lines 
at  the  top  and  bottom  of  each  page.  When  -segment  is  specified 
without  -no_pagi nation,  there  will  be  60  lines  per  page  with  no  blank 
lines  between  pages  (NEWPAGE  characters  are  inserted  into  the  output 
segment  to  eject  a page  when  dprinting). 

When  dprinting  the  output  segment,  the  -no_endpage  option  should 
be  specified  for  the  dprint  command.  This  is  necessary  to  avoid  extra 
blank  pages  because  columns  formats  its  own  pages. 
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Warning 

This  command  expands  tabs  into  spaces  properly,  but  treats 
backspaces  and  all  other  control  characters  as  single  characters. 
Generally,  if  the  segment  contains  any  control  characters  (other  than 
tabs  and  newlines)  the  columns  on  the  output  will  not  line  up 
properly. 
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Subroutine 


Name : convert_word_ 

This  subroutine  is  used  to  convert  the  random  word  array  returned 
by  random_word_  to  ASCII. 

Usage 


del  convert^ word_  entry  ((0:*)  fixed  bin,  (0:*)  bit(1)  aligned, 
fixed  bin,  char(*),  char(*)); 

call  convert_word_  (word,  hyphenated_word,  word_length, 
ascii_word,  ascii_hyphenated_word) ; 

1)  word  Array  of  random  units  returned  from  a previous  call  to 

random_word_.  ( Input) 

2)  hyphenated_word 

Array  of  bits  indicating  where  hyphens  are  to  be 
placed,  returned  from  random_word_.  (Input) 

3)  word_length  Number  of  units  in  word,  returned  from  random_word_. 

( Input) 

4)  ascii_word  This  string  will  contain  the  word,  left  justified,  with 

trailing  blanks.  This  string  should  be  long  enough  to 
hold  the  longest  word  that  may  be  returned.  This  is 
normally  the  value  of  "maximum"  supplied  to 
random_word_.  (Output) 

5 ) ascii_hyphenated_word 

This  string  will  contain  the  word,  with  hyphens  between 
the  syllables,  left  justified  within  the  string.  The 
length  of  this  string  should  be  at  least  3*maximum/2+1 
to  guarantee  that  the  hyphenated  word  will  fit. 
(Output) 

Entry:  c on ve rt_wo r d_$ n o_hy phen s 

This  entry  can  be  used  to  obtain  the  ASCII  form  of  a random  word 
without  the  hyphenated  form. 
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Usage 

del  convert_word_$no_hyphens  ((0:*)  fixed  bin,  fixed  bin, 
char( *) ) ; 

call  convert_word_$no_hyphens  (word,  word_length,  ascii_word); 
Arguments  are  the  same  as  above. 
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Subroutine 

Name : convert_word_ 

This  subroutine  facilitates  printing  of  the  hyphenated  word 
returned  from  a call  to  hyphenate^. 

Usage 

del  convert^ word_char_  entry  (char(*),  (*)  bi t ( 1 ) aligned,  fixed 
bin,  char(*)  varying); 

call  convert_word_char_  (word,  hyphens,  last,  result); 


1 ) word 

This  string  is  the  word  to  be  hyphenated.  (Input) 

2)  hyphens 

This  is  the  array  returned  from  a call  to  hyphenate_ 
that  marks  characters  in  word  after  which  hyphens  are 
to  be  inserted.  (Input) 

3)  last 

This  is  the  status  code  returned  from  hyphenate_.  If 
negative,  the  result  will  be  the  original  word, 
unhyphenated,  with  **  following  it.  If  positive,  the 
word  will  be  returned  hyphenated,  but  with  an  asterisk 
preceding  the  last'th  character.  If  zero,  the  word 
will  be  returned  hyphenated  without  any  asterisks. 
(Input) 

4)  result 

This  string  contains  the  resultant  hyphenated  word. 
(Output ) 
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Command 


Name:  digram^ table_compiler , dtc 

This  command  compiles  a source  segment  containing  the  digrams  for 
the  random  word  generator  and  produces  an  object  segment  with  the  name 
"digrams" . 

Usage 


digram_table_compiler  pathname  -option- 


1 ) pathname 


2)  -option- 
-list,  -Is 


is  the  pathname  of  the  source  segment.  If  the 
suffix  ".dtc"  does  not  appear,  it  will  be  assumed. 
Regardless  of  the  name  of  the  source  segment,  the 
output  segment  will  always  be  given  the  name 
"digrams"  and  will  be  placed  in  the  working 
directory. 

may  be  the  following: 

lists  the  compiled  table  on  the  terminal.  The 
table  will  be  printed  in  columns  to  fit  the 
terminal  line  length.  If  file_output  is  being 
used,  lines  will  be  132  characters  long. 


-list  n,  -Is  n lists  the  table  as  above,  but  uses  n as  the  number 
of  columns  to  print.  Each  column  occupies  14 
positions,  thus  a value  of  5 will  cause  5 columns 
to  be  printed,  each  line  being  70  characters  long. 
This  option  is  useful  when  file_output  is  being 
used,  so  that  the  lines  produced  are  not  too  long 
to  fit  on  the  terminal  to  be  used  to  print  the 
output  file. 


Notes 


The  compiler  makes  an  attempt  to  detect  inconsistent  combinations 
of  attributes,  as  well  as  syntax  errors.  If  an  error  is  encountered 
during  compilation,  processing  of  the  source  segment  will  continue  if 
possible.  The  digrams  segment  in  case  of  an  error  will  be  left  in  an 
undefined  state. 
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During  compilation,  the  ALM  assembler  is  used..  At  that  point  the 
letters  "ALM"  will  be  printed  on  the  terminal.  If  compilation  was 
successful,  no  other  messages  should  appear. 

The  listing  produced  by  digram_table_compiler  is  in  a format 
suitable  for  printing  on  the  terminal  --  not  for  dprinting.  This  is 
because  blank  lines  are  used  for  page  breaks,  instead  of  the  "new 
page"  character  as  recognized  by  dprint. 

Syntax 

The  syntax  of  the  source  segment  is  specified  below.  Spaces  are 
meaningful  to  this  compiler  and  a space  is  only  allowed  where 
specified  as  <space>.  The  new  line  character  is  indicated  as 
<new  line>. 


<digram  table>::=  <unit  specs>;[<new  line> ] . . . <digram  specs>$ 

<unit  specs>::=  <unit  spec> [ <delim><unit  spec>]... 

<digram  specs>::=  <digram  spec>[<delim><digram  spec>]... 

<delim>:  :=  ,[<new  line>]'<new  line> 

<unit  spec>: :=  <unit  name>[<not  begin  syllable>[<no  final  split>]] 
<digram  spec>::=  [<beginXnot  beginXbreakXprefix>] 

<unit  nameXunit  name>[<suffix>[<end>[<not  end>]]] 
<unit  name>::=  <letter>[<letter>] 

<letter>: :=  a!bic!die!f!g!hiiijik!lim!nio!piq!rjsit$uiviw!x!y!z 
<not  begin  syllable>::=  <bit> 

<no  final  split>::=  <bit> 

<begin>::=  <bit> 

<not  begin>: :=  <bit> 

<break>::=  <bit> 

<pr ef ix> : : = <space> ! - 
<suffix>::=  <space>i-j+ 

<end>: :=  <bit> 

<not  end>::=  <bit> 

<bit> : :=  <space> ! 1 


The  first  part  of  the  <digram  table>  consists  of  definitions  of 
the  various  units  that  are  to  be  used  and  their  attributes.  The  units 
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are  defined  as  one  or  two-letter  pairs,  and  the  order  in  which  they 
are  defined  is  unimportant.  For  each  unit,  the  attributes  <not 
begin  syllable>  and  <no  final  split>  may  be  specified.  In  addition, 
if  <unit  name>  is  a,  e,  i,  o,  or  u,  the  "vowel"  attribute  is  set.  If 
the  unit  is  y,  the  "alternate  vowel"  attribute  is  set.  A <bit>  is 
assumed  to  be  zero  if  specified  as  <spaee>,  or  one  if  specified  as  1. 

The  second  part  of  <digram  table>  specifies  all  possible  pairs  of 
units  and  the  attributes  for  each  pair.  The  order  in  which  these 
pairs  must  be  specified  depends  on  the  order  of  the  Cunit  specs>  as 
follows: 

Number  the  Cunit  spec>s  from  1 to  n in  the  order  in  which  they 
appeared  in  Cunit  specs>.  The  first  Cdigram  spec>  must  consist  of  the 
pair  of  units  numbered  (1,1),  the  second  Cdigram  spec>  is  the  pair 

(1,2),  etc.,  and  the  last  Cdigram  spec>  is  the  pair  (n,n).  All  pairs 

must  be  specified,  i.e.,  there  must  be  n*n  Cdigram  spec>s.  The  Cbit>s 
preceding  or  following  each  pair  set  the  attributes  for  that  pair  as 
shown.  The  Cprefix>  and  Csuffix>  indicators  are  set  to  1 if  specified 
as  If  Csuffix>  is  specified  as  "+",  the  "illegal  pair"  indicator 

will  be  set,  and  no  other  attributes  may  be  specified  for  that 
Cdigram  spec>. 

Example 

The  following  is  a very  short  example  of  a Cdigram  table>.  Only 
four  units  are  defined,  "a",  "b" , "sh"  and  "e" . The  letter  "e"  is 
given  the  "no  final  split"  attribute,  the  pair  "aa"  is  given 

"illegal  pair",  the  pair  "ae"  is  given  the  "not  begin",  "break",  and 

"not  end"  attributes,  etc. 

a, b,sh,e  1 ; 
aa+,ab,ash,  1 1 ae  1 
ba,  1 bb,  11  bsh  1,be 
sha,  11  shb  1 ,shsh+,she,ea,eb,esh,ee 
$ 


Assume  the  above  segment  was  named  "dt.dtc".  Below  is  an  example 
of  the  command  used  to  compile  and  list  the  table  produced  for  dt. 
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digram_table_compiler  dt  -Is 
ALM 


1 a 0010  2 b 0000 


000 

aa 

+00 

000 

ba 

00 

000 

ab 

00 

010 

bb 

00 

000 

ash 

00 

011 

bsh 

01 

011 

ae 

01 

000 

be 

00 

3 sh  0000  4 e 0110 


000 

sha 

00 

000 

ea 

00 

011 

shb 

01 

000 

eb 

00 

000 

shsh+00 

000 

esh 

00 

000 

she 

00 

000 

ee 

00 

The  first  line  of  output  lists  the  individual  units.  The  number 
preceding  the  unit  is  the  unit  index.  The  four  bits  following  the 
unit  are  respectively: 

not  begin  syllable 
no  final  split 
vowel 

alternate  vowel 

Following  the  unit  specifications  are  the  digram  specifications. 
Preceding  each  digram  are  three  bits  and  a space  (or  possibly  a 
with  meanings  corresponding  to  those  specified  in  the  source  segment 
as  follows: 

begin 
not  begin 
break 

prefix  (if  appears) 

Immediately  following  each  digram  is  a field  which  may  be  blank, 
or  "+".  If  , the  "illegal  pair"  flag  is  set.  Otherwise,  the 

meaning  of  the  "-"  and  following  two  bits  are  as  follows: 

suffix  (if  appears) 

end 

not  end 
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Name:  pr int_digram_table,  pdt 

This  entry  merely  prints  the  digram  table  on  the  terminal, 
assuming  that  it  has  already  been  compiled  successfully.  The  segment 
"digrams"  is  assumed  to  be  located  in  the  working  directory. 

Usage 


print_digram_table  -n- 

1)  n is  the  number  of  columns  in  which  to  print  the  table.  If 

not  specified,  the  maximum  number  of  columns  that  will  fit 
in  the  terminal  line  will  be  used.  Each  column  occupies  14 
positions.  If  file_output  is  being  used,  the  terminal  line 
width  is  assumed  to  be  132. 

Notes 


This  entry  performs  the  same  function  as  the  -list  option  of 
digram_table_compiler . 
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Subroutine 


Name:  generate_word_ 

This  subroutine  returns  a random  pronounceable  word  as  an  ASCII 
character  string.  It  also  returns  the  same  word  split  by  hyphens  into 
syllables  as  an  aid  to  pronunciation. 

Usage 


declare  generate_word_  entry  (char(*),  char(*),  fixed  bin,  fixed 
bin) ; 

call  generate_word_  (word,  hyphenated_word,  min,  max); 

1)  word  is  the  random  word,  padded  on  the  right  with 

blanks.  This  string  must  be  long  enough  to  hold 
the  word  (at  least  as  long  as  max).  (Output) 

2)  hyphenat ed_word  is  the  same  word  split  into  syllables.  The  length 

of  this  string  must  be  greater  than  max  to  allow 
for  the  hyphens.  A length  of  3*max/2  + 1 will 
always  be  sufficient.  (Output) 

3)  nin  is  the  minimum  length  of  the  word  to  be  generated. 

This  value  must  be  greater  than  3 and  less  than 
21.  (Input) 

4)  max  is  the  maximum  length  of  the  word  to  be  generated. 

The  actual  length  of  the  word  will  be  uniformly 
random  between  min  and  max.  The  value  of  max  must 
be  greater  then  or  equal  to  min,  and  less  than  21. 
(Input) 

Note 


Each  call  to  generate_word_  should  produce  a different  random 
word,  regardless  of  when  the  call  is  made.  However,  as  with  any 
random  generator,  there  is  no  guarantee  that  there  will  be  no 
duplicates.  The  probability  of  duplication  is  greater  with  shorter 
wo rds. 
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Entry:  generate_word_$init_seed 

This  entry  allows  the  user  to  specify  a starting  seed  for 
generating  random  words.  If  a seed  is  specified,  the  exact  same 
sequence  of  random  words  will  always  be  generated  on  subsequent  calls 
to  generate_word_  providing  the  same  values  of  min  and  max  are 
specified.  If  this  entry  is  not  called  in  a process,  the  value  of  the 
clock  is  used  as  the  initial  seed  on  the  first  call  to  generate_word_, 
thereby  "guaranteeing"  different  sequences  of  words  in  different 
processes. 

Usage 


declare  generate_word_$init_seed  entry  (fixed  bin(35)); 

call  generate_word_$init_seed  (seed); 

1)  seed  is  the  initial  seed  value.  If  zero,  the  system  clock 

will  be  used  as  the  seed.  (Input) 
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Command 


Name : generate_words,  gw 

This  command  will  print  random  pronounceable  "words"  on  the 
user's  terminal. 

Usage 


generate_words  -control_args- 
1)  control_args  may  be  selected  from  the  following: 

nwords  is  the  number  of  words  to  print.  If  not  specified, 

one  word  is  printed. 

-min  n^  specifies  the  minimum  length,  in  characters,  of  the 

words  to  be  generated. 

-max  n.  specifies  the  maximum  length  of  the  words  to  be 

generated . 

-length  n,  -In  n_  specifies  the  length  of  the  words  to  be  generated. 

If  this  argument  is  specified,  all  words  will  be 
this  length,  and  -min  or  -max  may  not  be  specified. 

-hyphenate,  -hph  causes  the  hyphenated  form  (divided  into  syllables) 

of  each  word  to  be  printed  alongside  the  original 
word. 

-seed  SEED  On  the  first  call  to  generate_words  in  a process, 

the  system  clock  is  used  to  obtain  a starting 

"seed"  for  generating  random  words.  This  seed  is 
updated  for  every  word  generated,  and  subsequent 
values  of  the  seed  depend  on  previous  values  (in  a 
rather  complex  way).  If  the  -seed  argument  is 

specified,  SEED  must  be  a positive  decimal  integer. 
For  a given  value  of  SEED,  the  sequence  of  random 
words  will  always  be  the  same  providing  the  same 
length  values  are  specified.  When  no  -seed 

argument  is  specified,  the  last  value  of  the 
updated  seed  from  the  previous  call  to 


149 


generate_words 


MULTICS  PROGRAMMERS'  MANUAL 


Page  2 

generate_words  will  be  used.  To  revert  back  to 
using  the  system  clock  as  the  seed,  specify  a zero 
value  for  SEED,  i.e.,  -seed  0. 

Notes 


If  neither  -min,  -max,  nor  -length  are  specified,  the  defaults 
are  -min  6 and  -max  8.  In  all  other  cases,  the  defaults  are  -min  4 
and  -max  20. 

If  -length  is  not  specified,  the  lengths  of  the  random  words  will 
be  uniformly  distributed  between  min  and  max.  Words  generated  are 
printed  one  per  line,  with  the  hyphenated  forms,  if  specified,  lined 
up  in  a column  alongside  the  original  words. 
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Command/Active  Function 


Name:  get_line_length,  gll 

This  command  or  active  function  returns  the  current  length  of  a 
line  on  the  stream  user_output.  When  used  as  a command,  the  line 
length  is  printed  on  the  terminal.  When  used  as  an  active  function, 
the  line  length  is  returned  as  a character  string. 

Usage 


get_line_length 

[get_line_length] 


Notes 


The  length  of  the  line  is  obtained  from  the  modes  supplied  to  the 
tw_  IOSIM.  If  user_output  is  attached  through  some  other  interface 
module  (such  as  file_  when  using  file_output)  the  line  length  may  be 
undefined.  If  the  line  length  can  not  be  obtained,  a default  length 
of  132  will  be  returned. 
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Subroutine 


Name : get_line_length_ 

This  subroutine  returns  the  current  length  of  a line  on  the 
stream  user_output. 

Usage 

del  get_line_length_  entry  returns  (fixed  bin); 

11  = get_line_length_  (); 

1)  11  is  the  length  of  the  line. 

Notes 


The  length  of  the  line  is  obtained  from  the  modes  supplied  to  the 
tw_  IOSIM.  If  user_output  is  attached  through  some  other  interface 
module  (such  as  file_  when  using  file_output)  the  line  length  may  be 
undefined.  If  the  line  length  can  not  be  obtained,  a default  length 
of  132  will  be  returned. 


153 





MULTICS  PROGRAMMERS'  MANUAL 


hyphen_test 


Command 


Names : hyphen_test,  ht 

This  command  uses  the  random  word  generator  (the  same  one  used  by 
generate_words ) to  divide  words  into  syllables.  Words  are  printed  on 
the  terminal  with  hyphens  between  the  syllables. 

Usage 


hyphen_test  -control_arg-  -wordJ_-  . . . -wordn- 


1)  control_arg  may  be  -probability  (-pb),  specifying  that  the 

probability  of  each  of  the  words  that  follows  be 
printed  alongside  the  hyphenated  word. 

2)  wordi.  are  one  or  more  words  to  be  hyphenated.  A word 

may  consist  of  three  to  twenty  alphabetic 
characters,  only  the  first  of  which  may  be 
uppercase. 

Notes 


The  control  argument  may  appear  anywhere  in  the  command  line. 
However,  it  only  applies  to  words  that  follow.  Words  preceding  the 
option  will  be  hyphenated  but  no  probabilities  will  be  calculated. 


If  a word  contains  any  illegal  characters,  or  is  not  of  three  to 
twenty  characters  in  length,  the  word  will  be  printed  unhyphenated, 
followed  by  **. 


If  the  word  could  not  be  completely  hyphenated  because  it  was 
considered  unpronounceable,  an  asterisk  (*)  will  be  printed  out  in 
front  of  the  first  character  that  was  not  accepted.  The  part  of  the 
word  before  the  asterisk  will  be  properly  hyphenated. 

The  calculated  probability  is  the  probability  that  the  word  would 
have  been  generated  by  generate_words,  assuming  generate_words  was 
requested  , to  generate  a word  of  that  length  only.  If  a range  of 
lengths  is  requested  of  generate__ words,  each  length  has  equal 
probability.  For  example,  if  generate_words  is  called  to  generate 

words  of  6,  7,  or  8 characters,  there  is  a 33%  probability  that  a 
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given  word  will  have  8 characters.  If  hyphen_test  is  then  asked  to 
calculate  the  probability  of  a given  8 letter  word,  that  probability 
should  be  divided  by  3 to  obtain  the  correct  probability  for  the  case 
of  three  possible  lengths. 
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Subroutine 


Name : hyphenate_ 

This  subroutine  attempts  to  hyphenate  a word  into  syllables. 


Usage 


del  hyphenate^  entry  (char(*),  (*)  bit(1)  aligned,  fixed  bin); 
call  hyphenate_  (word,  hyphens,  code); 


1 ) word 


2)  hyphens 


3)  code 


Notes 


This  is  a left  justified  ASCII  string,  3 to  20 
characters  in  length.  This  string  must  contain  all 
lowercase  alphabetic  characters,  except  the  first 
character  may  be  uppercase.  Trailing  blanks  are  not 
permitted  in  this  string.  (Input) 

This  array  will  contain  a M 1Hb  for  every  character  in 
the  word  that  is  to  have  a hyphen  following  it. 
(Output) 

This  is  a status  code,  as  follows: 

0 word  has  been  successfully  hyphenated. 

-1  word  contains  illegal  (non  alphabetic  or 
uppercase ) characters . 

-2  word  was  not  from  three  to  twenty  characters  in 
length. 

Any  positive  value  of  code  means  that  the  word  couldn't 
be  completely  hyphenated.  In  this  case,  code  is  the 
position  of  the  first  character  in  word  that  was  not 
acceptable.  The  part  of  the  word  before  code  will  be 
properly  hyphenated.  (Output) 


This  subroutine  uses  random_word_  to  provide  the  hyphenation.  It 
does  this  by  calling  random_word_$give_up  and  supplying  its  own 
version  of  random_unit  and  random_vowel  that  return  specified  units 
(of  the  particular  word  to  be  hyphenated)  instead  of  random  units. 
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The  word  supplied  to  hyphenate_  is  first  transformed  into  units 
by  translating  pairs  of  letters  into  single  units  if  a 2-letter  unit 
is  defined  for  the  pair,  and  then  by  translating  the  remaining  single 
letters  into  units.  See  the  write-ups  of  random_word_  and 
random_unit_  for  a description  of  units.  If  any  units  of  the  word  are 
refused  by  random_word_,  hyphenate  tries  to  determine  if  the  refused 
unit  was  a 2-letter  unit.  If  this  is  the  case,  then  the  2-letter  unit 
is  broken  into  two  1-letter  units  and  random_word_  is  called  again. 
In  rare  cases,  hyphenate_  is  not  able  to  determine  which  2-letter  unit 
is  at  fault,  and  will  return  a status  code  indicating  that  the  word  is 
unpronounceable,  when,  in  fact,  it  could  have  been  properly  divided  by 
breaking  up  a 2-letter  unit. 

Entry:  hyphenate_$ probability 

This  entry  returns  information  as  above,  but  also  supplies  the 
probability  of  the  word  having  been  generated  at  random  by 
generate_word_  or  random  word  generator  . The  assumption  is  made  that 
generate_word_  or  random  word  generator  was  asked  to  supply  a word  of 
exactly  the  same  length  as  the  word  given  to  hyphenate_,  rather  than  a 
range  of  lengths.  If  a range  of  lengths  was  asked  of  generate_word_, 
the  probability  must  be  divided  by  the  number  of  different  lengths 
(all  lengths  are  equally  probable). 

Usage 


del  hyphenate_  entry  (char(*),  (*)  bi t ( 1 ) aligned,  fixed  bin, 
float  bin); 

call  hyphenate_  (word,  hyphens,  code,  probability); 

1)  to  3)  are  as  above. 

4)  probability  is  the  probability  as  defined  above.  (Output) 
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Notes 


If  the  supplied  word  is  illegal  (i.e.  code  is  not  zero),  the 
probability  will  be  returned  as  zero. 

Entry:  hy phenate_$debug_ on , hyphenate_$debug__of f 

These  entries  set  and  reset  a switch  that  causes 
hyphenate_$probabili ty  to  print,  on  user_output,  all  units  (see 
r andom_word__  and  random__unit_  for  a description  of  units)  that  are 
illegal  in  a given  position  of  the  word.  This  entry  is  useful  for 
debugging  a digram  table  for  randora__word_. 

Usage 


del  hyphenate_$debug_on  entry; 
del  hyphenate_$debug_of f entry; 

call  hyphenate_$debug_on ; 
call  hyphenate^ $debug_of f ; 


Notes 


An  example  of  the  output  produced  is  as  follows.  The  assumption 
is  that  hyphenate_$probabili ty  is  invoked  by  the 

hyphen_test  command  using  the  -probability  option. 

hyphenate_$debug_on 
hyphen_test  -probability  fish 

x,ck,f ; b,c,d,f,g,h,j,k,m,n,p,s,t,v,w,x,y,z,ch,gh,ph, 
rh,sh, th, wh,qu,ck ,i;  i , rh, wh, qu, sh; 
fish  6.04l27576e-5 

In  the  above  example,  the  units  x and  ck  are  shown  to  have  been 
illegal  as  the  first  unit  of  the  word,  and  the  unit  f.,  (underlined)  is 
the  first-  unit  of  the  word  that  was  accepted.  All  other  units  that 
were  not  printed  are  legal  as  the  first  unit  of  the  word.  Following 
the  semicolon  after  f_  are  the  units  that  are  illegal  in  the  second 
position  of  the  word  (assuming  that  f is  the  first  unit).  Then  i.  is 
shown  as  the  legal  unit  that  is  taken  from  the  word  "fish”.  This 
repeats  for  each  position  of  the  word,  ending  in  the  legal  unit  sh 
(note  only  one  underline). 


159 


hyphenate. 


MULTICS  PROGRAMMERS'  MANUAL 


Page  4 


If  the  supplied  word  is  illegal,  the  last  underlined  letter  in 
the  output  is  (usually)  the  letter  that  was  not  accepted.  In  cases 
where  hyphenate_  has  to  split  up  a 2-letter  unit,  the  word  will  be 
shown  to  start  over  from  the  beginning. 
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Name:  random_unit_ 

This  subroutine  provides  a random  unit  number  for  random_word_ 
based  on  a standard  distribution  of  a given  set  of  units.  It  is 
referenced  by  the  generate_word_  subroutine  as  an  entry  value  that  is 
passed  in  the  call  to  random_word_.  This  subroutine  assumes  that  the 
digram  table  being  used  by  random_word_  is  a standard  table.  The 
digram  table  itself  is  not  referenced  by  this  subroutine. 

Usage 


declare  random_unit__  entry  (fixed  bin); 
call  random_unit_  (unit); 

1)  unit  is  a number  from  1 to  3^  that  corresponds  to  a particular 
unit  as  listed  in  Notes  below.  (Output) 


Notes 


The  table  below  contains  the  units  that  are  assumed  specified  in 
the  digrams  supplied  to  random_word_.  Shown  in  the  table  are  the  unit 
number,  the  letter  or  letters  that  unit  represents,  and  the 
probability  of  that  unit  number  being  generated. 


1 

a .04739 

8 

h .02844 

15 

0 .04739 

22 

w .03792 

29 

rh  .00474 

2 

b .03792 

9 

i .04739 

16 

p .02844 

23 

x .00474 

30 

sh  .00948 

3 

c .05687 

10 

j .03792 

17 

r .04739 

24 

y .03792 

31 

th  .00948 

4 

d .05687 

1 1 

k .03792 

18 

s .03792 

25 

z .00474 

32 

wh  .00474 

5 

e .05687 

12 

1 .02844 

19 

t .04739 

26 

ch  .00474 

33 

qu  .00474 

6 

f .03792 

13 

m .02844 

20 

u .02844 

27 

gh  .00474 

34 

ck  .00474 

7 

g .03792 

14 

n .04739 

21 

v .03792 

28 

ph  .00474 

Entry:  random_unit_$random_vowel 

This  entry  returns  a vowel  unit  number  only. 


Usage 


declare  random_unit_$random_vowel  (fixed  bin); 
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call  random_unit_$random_vowel  (unit); 

1)  unit  As  above.  (Output) 

Notes 

Below  are  listed  the  vowel  units  and  their  distributions. 

1 a .167 
5 e .250 
9 i .167 
15  o .167 
20  u .167 
2M  y .083 

Entry:  random_unit_$probabilities 

This  entry  returns  arrays  containing  the  probabilities  of  the 
units  as  listed  in  the  table  on  the  previous  page.  This  entry  is 
provided  for  hyphenate_$ probability  and  any  other  program  that  might 
require  this  information.  The  probabilities  must  be  computed  when 
this  entry  is  called,  so  it  is  suggested  that  the  call  be  made  only 
once  per  process  and  the  values  saved  in  internal  static  storage. 

Usage 


declare  random_unit_$probabili ties  entry  ((*)  float  bin, 

(*)  float  bin); 

call  ran dom_unit_$probabili ties  (unit_probs,  vowel_probs) ; 

1)  unit_probs  This  array  contains  the  probabilities  of  the  individual 

units  assuming  the  random_unit_  entry  is  called  to 
generate  the  random  units.  The  value  of  unit_probs(i) 
is  the  probability  of  unit(i).  (Output) 

2)  vowel_probs  This  array  contains  the  probabilities  of  the  units  when 

random_vowel  is  called.  Since  there  are  only  6 vowels, 
most  of  these  values  will  be  zero.  (Output) 
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Notes 


A future  version  of  random__unit__  may  use  different  units  with 
different  probabilities.  The  size  of  the  two  arrays  must  be  large 
enough  to  hold  the  maximum  number  of  values  that  may  be  returned  by 
random_junit_  (which  is  currently  3*0*  Programs  should  not  depend  on 
the  unit__index-to-letter  correspondence  as  shown  in  the  table.  This 
information  can  be  obtained  by  using  the  include  file 
digram__structure.  incl.pl  1 . 


t 
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Subroutine 


Name : random_word_ 

This  routine  returns  a single  random  pronounceable  word  of 
specified  length.  It  is  called  by  generate_word_,  and  allows  the 
caller  to  specify  the  particular  subroutines  to  be  used  to  generate 
random  units.  For  users  desiring  random  words  with  an  English-like 
distribution  of  letters,  generate_word_  should  be  used. 

Usage 


del  random_word_  entry  ((0:*)  fixed,  (0:*)  bit(1)  aligned,  fixed, 
fixed,  entry,  entry); 

call  random_word_(word,  hyphens,  char_length,  unit_length, 
random_unit,  random_vowel) ; 

The  random  word  will  be  stored  in  this  array  starting 
at  word(1)  (word(0)  will  always  be  0).  The  numbers 
stored  will  correspond  to  a "unit  index"  as  described 
in  Notes  below.  This  array  must  have  a length  at  least 
equal  to  the  value  of  "char_length".  Unused  positions 
in  this  array,  up  to  word(char_length) , will  be  set  to 
zero.  (Output) 

This  array  must  be  of  length  at  least  "char_length" . A 
bit  on  in  a position  of  this  array  indicates  that  the 
corresponding  unit  in  "word"  (including  the  very  last 
unit)  is  the  last  unit  of  a syllable.  (Output) 

3)  char_length  Length  of  the  word  to  be  generated,  in  characters. 

( Input) 

4)  unit_length  This  is  the  length  of  the  generated  random  word  in 

units,  i.e.,  the  index  of  the  last  non-zero  entry  in 
the  "word"  array.  The  actual  length  of  the  word  in 
equivalent  characters  will  be  the  value  of  char_length. 
(Output) 

5)  random_unit  This  is  the  routine  that  will  be  called  by  random_word_ 

each  time  a random  unit  is  needed.  The  random_unit 


1)  word 


2)  hyphens 
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routine  is  declared  as  follows: 

del  randora_unit  entry  (fixed  bin); 

where  the  value  returned  is  a unit  index  between  1 and 
n_units.  If  an  English-like  distribution  of  letters  is 
desired,  the  "random_unit_"  subroutine  may  be  specified 
here.  See  Notes  below.  (Input) 


6)  randora_vowel 

This  is  the  routine  called  by  random_word_  when  a vowel 
unit  is  required.  This  routine  must  return  the  index 
of  a unit  whose  "vowel"  or  "alternate_vowel"  bits  are 
on.  See  Notes  below.  This  routine  is  declared  as 
follows: 

del  random_vowel  entry  (fixed  bin); 

If  desired,  the  subroutine  "random_unit_ $random_vowel" 
may  be  specified  in  this  place.  (Input) 

Notes 

The  word  array  can  be  converted  into  characters  by  calling 
c onve  rt_wo  rd_. 

In  order  to  use  random_word,  a digram  table,  contained  in  a 
segment  named  "digrams",  must  be  available  in  the  search  path.  This 
table  can  be  created  by  the  digram_table_corapiler . 

If  the  user  supplies  his  own  versions  of  random_unit  and 
random_vowel,  these  subroutines  will  have  to  supply  legal  units  that 
are  recognized  by  the  random_word_  subroutine.  The  include  file 
"digram_structure.incl.pl 1"  can  be  used  to  reference  the  digram  table 
to  determine  which  units  are  available.  If  included  in  the  source 
program,  appropriate  references  to  the  following  variables  of  interest 
in  "digrams"  will  be  generated: 

del  n_units  fixed  bin  defined  digrams$n_units; 
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del  letters(0:n_units)  char(2)  aligned 
based(addr(digrams$letters) ) ; 
del  1 rules(n_units)  aligned  based(addr(digrams$rules) ) , 
2 vowel  bit ( 1 ) , 

2 alternate_vowel  bit(1), 


where : 

n_units  is  the  number  of  different  units. 

letters(i)  contains  1 or  2 characters  (left  justified)  for 

the  i'th  unit. 

rules. vowel ( i ) , rules. alt ernate_vowel( i ) 

One  of  these  two  bits  are  set  for  the  units  that 
may  be  returned  by  a call  to  random_vowel. 

When  randomjunit  is  called,  a number  from  1 to  n_units  must  be 
returned.  When  random_vowel  is  called,  a number  from  1 to  n_units, 
where  one  of  the  two  bits  in  rules (i)  is  marked,  must  be  returned. 

Entry:  random_word_$debug_on 

This  entry  sets  a switch  in  random_word__  that  causes  printing  (on 
user_output)  of  partial  words  that  could  not  be  completed.  This  entry 
is  of  interest  during  debugging  of  random_word__  or  for  checking  the 
consistency  of  the  digram  table  prepared  by  the  user. 

Usage 


del  random_word_$debug_on  entry; 
call  random_word_$debug_on ; 

Entry:  random_word_$debug_of f 

This  entry  resets  the  switch  set  by  debug_on. 
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Additional  notes 


The  random_word_  subroutine  can  be  used  for  certain  special 
applications  (such  as  the  application  used  by  hyphenate^ ),  and  there 
are  certain  features  that  help  support  some  of  these  applications. 
The  features  described  below  are  of  little  interest  to  most  users. 

The  first  feature  allows  the  caller-supplied  random^ unit  (and 
random_vowel)  subroutine  to  find  out  whether  random_word__  "accepted” 
or  "rejected"  the  previous  unit  supplied  by  random_unit.  Each  time 
random_unit  is  invoked  by  random_word_,  the  value  of  the  argument 
passed  is  the  index  of  the  previous  unit  that  random_unit_  returned 
(or  zero  on  the  first  call  to  random_unit  in  a given  invocation  of 
random_word_) . The  sign  of  the  argument  will  be  positive  if  this  last 
unit  was  accepted.  "Accepted"  means  that  the  last  unit  was  inserted 
into  the  random  word  and  the  word  index  maintained  by  random_word__  was 
incremented.  Once  a unit  is  accepted,  it  is  never  removed.  Thus  a 
positive  value  of  the  unit  index  passed  to  random_unit  means  that  a 
unit  for  the  next  position  of  the  word  is  requested. 

If  the  unit  index  passed  to  random_unit  has  a negative  sign,  the 
last  unit  was  rejected  according  to  the  rules  used  by  random^ word_  and 
information  supplied  in  the  digram  table.  If  the  unit  is  rejected, 
random_word_  does  not  advance  its  word  index  and  calls  random_unit 
again  for  another  unit  for  that  same  word  position.  With  this 
information  random_unit  can  keep  track  of  the  "progress"  of  the  word 
being  generated. 

The  feature  described  above  is  used  by  the  special  random_unit 
routine  provided  by  hyphenate_.  Since  the  random_unit  routine  for 
hyphenate^  is  not  really  supplying  random  units  (but  is  supplying 
units  of  the  word  to  be  hyphenated) , it  must  know  whether  any 
particular  unit  is  rejected  by  random_word_.  Rejection  then  implies 
that  the  word  is  illegal  according  to  random_word_  rules. 

The  second  feature  allows  random_unit  to  "try"  a certain  unit 
without  committing  that  unit  to  actually  be  used  in  the  random  word. 
The  sign  of  each  unit  supplied  to  randora_word_  by  random^ unit  is 
checked.  If  the  sign  of  the  word  is  positive,  random_word_  will 
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accept  or  reject  the  unit  according  to  its  rules,  and  will  indicate 
this  on  the  subsequent  call  to  randorn_unit . 

If  the  sign  of  the  unit  passed  to  random_word_  is  negative, 
randomjword__  will  merely  indicate  (on  the  subsequent  call  to 
random_unit)  whether  that  unit  would  have  been  accepted,  but  it  never 
actually  updates  the  word  index.  In  other  words,  randorn_word_  always 
rejects  the  unit,  but  lets  random__unit  know  whether  the  unit  was 
acceptable. 

This  latter  feature  is  used  by  hyphenate^ probability  in  order  to 
determine  which  of  all  possible  units  are  acceptable  in  a given 
position  of  the  word.  The  randorn_unit  routine  used  by 
hyphenate^ pr obabili ty  tries  all  possible  units  in  each  word  position, 
and  only  allows  random_word_  to  accept  the  unit  that  actually  appears 
in  that  position. 
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Subroutine 


Name:  read_table_ 

This  subroutine  is  the  compiler  for  the  digram  table  for 
random^ word_.  It  is  called  by  digram_table_compiler . 

Usa.'re 


1) 

2) 

3) 


declare  read_table_  entry  (ptr,  fixed  bin(24),  returns  ( bi t ( 1 ) ) ; 

flag  = read_table_  (source_ptr,  bitcount); 

source_ptr  is  a pointer  to  the  source  segment  to  be  compiled. 
(Input) 

bitcount  is  the  bit  count  of  the  source  segment.  (Input) 

flag  is  "0"b  if  compilation  was  successful.  It  is  " 1"b  if 

an  error  was  encountered. 


Notes 


If  compilation  was  successful,  the  compiled  table  will  be  placed 
in  the  working  directory  with  the  name  "digrams".  If  unsuccessful, 
the  digrams  segment  may  or  may  not  have  been  created,  and  may  be  left 
in  an  inconsistent  state  (i.e.,  unusable  by  random_word_) . Error 
messages  are  printed  out  on  user_output  as  the  errors  are  encountered, 
except  that  file  system  errors  are  printed  on  error_output . 

This  subroutine  uses  the  ALM  assembler  for  part  of  its  work.  As 
a result,  the  letters  "ALM"  will  be  printed  on  user_output  sometime 
during  the  compilation. 
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The  following  pages  contain  documentation  and  source  listings  for 
the  two  modules  that  have  been  altered  to  produce  uniformly  distribut- 
ed random  words  as  discussed  near  the  end  of  Section  IV.  The  two  mod- 
ules are  generate_word_  and  generate_words . In  addition,  a listing  of 
the  random  number  generator  encipher_  is  included  for  those  interested 
in  the  algorithm  used  to  obtain  random  numbers. 

Except  for  generate_words  and  generate_word_,  all  other  modules 
are  unchanged  from  those  shown  in  Appendix  III  and  Appendix  VI.  These 
two  modules  have  been  modified  in  an  upward  compatible  manner.  Thus, 
when  called  as  described  in  Appendix  VI,  they  will  perform  exactly  as 
described.  In  order  to  get  uniformly  distributed  words,  an  additional 
entry  point  in  generate_word_  and  an  additional  control  argument  to 
the  generate_words  command  have  been  provided. 
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Name : generate_word_ 

This  subroutine  returns  a random  pronounceable  word  as  an  ASCII 
character  string.  It  also  returns  the  same  word  split  by  hyphens  into 
syllables  as  an  aid  to  pronunciation. 

Usage 


declare  generate_word_  entry  (char(*),  char(*),  fixed  bin,  fixed 
bin)  ; 

call  generate_word_  (word,  hyphenated_word,  rain,  max); 

1)  word  is  the  random  word,  padded  on  the  right  with 

blanks.  This  string  must  be  long  enough  to  hold 
the  word  (at  least  as  long  as  max).  (Output) 

2)  hyphenated_word  is  the  same  word  split  into  syllables.  The  length 

of  this  string  must  be  greater  than  max  to  allow 
for  the  hyphens.  A length  of  3*raax/2  + 1 will  al- 
ways be  sufficient.  (Output) 

3)  min  is  the  minimum  length  of  the  word  to  be  generated. 

This  value  must  be  greater  than  3 and  less  than 
21.  (Input) 

*0  raax  is  the  maximum  length  of  the  word  to  ^e  generated. 

The  actual  length  of  the  word  will  be  uniformly 
random  between  min  and  max.  The  value  of  max  must 
be  greater  then  or  equal  to  min,  and  less  than  21. 

( Input) 

Note 


Each  call  to  generate_word_  should  produce  a different  random 
word,  regardless  of  when  the  call  is  made.  However,  as  with  any  ran- 
dom generator,  there  is  no  guarantee  that  there  will  be  no  duplicates. 
The  probability  of  duplication  is  greater  with  shorter  words. 
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Entry:  genera te_ word_$init_seed 

This  entry  allows  the  user  to  specify  a starting  seed  for 
generating  random  words.  If  a seed  is  specified,  the  exact  same 
sequence  of  random  words  will  always  be  generated  on  subsequent  calls 
to  generate^ word_  providing  the  same  values  of  min  and  max  are  speci- 
fied. If  this  entry  is  not  called  in  a process,  the  value  of  the 
clock  is  used  as  the  initial  seed  on  the  first  call  to  generate_word_, 
thereby  "guaranteeing"  different  sequences  of  words  in  different  proc- 
esses . 

Usage 


declare  generate_word_$init_seed  entry  (fixed  bin(35)); 

call  generate_word_$init_ seed  (seed); 

1)  seed  is  the  initial  seed  value.  If  zero,  the  system  clock 

will  be  used  as  the  seed.  (Input) 


Note 


If  the  seed  is  a small  integer,  the  first  few  words  generated  may 
not  be  quite  as  random  as  one  might  like,  i.e.,  if  5 is  specified  for 
the  value  of  seed  the  first  word  generated  will  be  almost  the  same  as 
when  some  other  small  integer  is  specified. 

Entry:  generate^ word_$uniform 

This  entry  is  the  same  as  generate_word_,  except  that  the  words 
produced  are  uniformly  distributed.  The  probabilities  of  the  words 
produced  by  generate_word_  are  not  all  the  same.  This  entry  provides 
words  with  equal  probability.  The  method  used  to  generate  uniformly 
distributed  words  results  in  a speed  degradation  that  is  worse  with 
longer  words.  For  eight  letter  words,  factor  of  at  least  10  should 
be  expected.  In  addition,  the  set  of  words  that  may  be  produced  is 
not  quite  as  large  (although  certainly  within  90?  of  the  set  produced 
by  generate_word_) . 

Usage 


declare  generate_word_$ uni form  entry  (char(*),  char(*),  fixed 
bin,  fixed  bin); 
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call  generate_word_  (word,  hyphenated_word,  min,  max); 
Arguments  are  the  same  as  above. 
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Command 


Name : generate^ words , gw 


This  command  will  print  random  pronounceable  "words”  on  the  user's 
terminal . 

Usage 


generate^ words  -control^ args- 
1)  control_args  may  be  selected  from  the  following: 

nwords  is  the  number  of  words  to  print.  If  not  specified, 

one  word  is  printed. 

-min  n specifies  the  minimum  length,  in  characters,  of  the 

words  to  be  generated. 

-max  ri  specifies  the  maximum  length  of  the  words  to  be 

generated. 

-length  n,  -In  n specifies  the  length  of  the  words  to  be  generated. 

If  this  argument  is  specified,  all  words  will  be 
this  length,  and  -min  or  -max  may  not  be  specified. 

-hyphenate,  -hph  causes  the  hyphenated  form  (divided  into  syllables) 
of  each  word  to  be  printed  alongside  the  original 
word . 

-seed  SEED  On  the  first  call  to  generate_words  in  a process, 

the  system  clock  is  used  to  obtain  a starting 

"seed"  for  generating  random  words.  This  seed  is 
updated  for  every  word  generated,  and  subsequent 
values  of  the  seed  depend  on  previous  values  (in  a 
rather  complex  way).  If  the  -seed  argument  is 

specified,  SEED  must  be  a positive  decimal  integer 
between  1 and  9999.  For  a given  value  of  SEED,  the 
sequence  of  random  words  will  always  be  the  same 
providing  the  same  length  values  are  specified. 
When  no  -seed  argument  is  specified,  the  last  value 
of  the  updated  seed  from  the  previous  call  to 

generate^ words  will  be  used.  To  revert  back  to 
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using  the  system  clock  as  the  seed,  specify  a zero 
value  for  SEED,  i.e.,  -seed  0. 

-uniform,  -uf  The  probability  of  the  words  produced  by  this  com- 
mand is  not  the  same  for  all  words,  i.e.,  some 
words  are  more  probable  than  others.  This  option 
changes  the  way  in  which  the  words  are  generated  so 
that  all  words  are  equally  probable.  The  number  of 
different  words  that  can  result  when  this  option  is 
specified  is  a little  smaller  than  the  number  of 
words  that  may  be  produced  without  this  option. 
(One  result  of  using  this  option  is  that  the  letter 
"q"  will  never  appear.)  The  use  of  this  argument 
results  in  a speed  degradation  by  a factor  of  about 
10  for  eight  letter  words,  and  greater  for  longer 
words.  This  argument  is  useful  when  it  is  desira- 
ble to  generate  words  whose  probabilities  are 
equal . 

Notes 

If  neither  -min,  -max,  nor  -length  are  specified,  the  defaults 
are  -min  6 and  -max  8.  In  all  other  cases,  the  defaults  are  -min  4 
and  -max  20. 

If  -length  is  not  specified,  the  lengths  of  the  random  words  will 
be  uniformly  distributed  between  min  and  max.  Words  generated  are 
printed  one  per  line,  with  the  hyphenated  forms,  if  specified,  lined 
up  in  a column  alongside  the  original  words. 
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000037  aa  0 00010  7251  20  107  1x15  ap! array_lengthf • "Get  length  (double  words) 

000040  aa  777777  6250  15  108  eax5  -1,5  "Check  for  zero  or  negative 

000041  Oa  000103  6040  00  109  tmi  return 

000042  aa  000000  6260  00  110  eax6  0 "X6  is  index  into  arrays 


000043  aa  6 00050  3521  00  111  eppbp  variables+CO  "Initial  cipher  text  from  key 

000044  112  cipher_loop: 

000044  aa  2 00000  2371  00  113  ldaq  bp ! 0 
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COMPILATION  LISTING  OF  SEGMENT  «enerate_word_ 
Compiled  by:  Multics  PL/I  Compiler  of  July  2,  1975. 
Compiled  on:  00/19/75  1016.9  edt  Tue 

Options:  check  source 
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5o  del  whole_seed  fixed  bin(71); 

57  del  1 half_seeds  based  (addr  ( whole_seed) ) , 
5b  2 (first,  second)  fixed  bin(35); 
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26  /*  This  is  the  same  as  letters,  but  allows  reference  to  individual  characters  •/ 


35  del  1 rules  (n_units)  aligned  based  (addr  (digramsjrules) ) , 

36  2 no__ f inal_spli t bit  (1),  /•  can't  be  the  only  vowel  in  last  syllable  */ 

37  2 not_begin_sy liable  bit  (1),  /•  can't  begin  a syllable  •/ 

3b  2 vowel  bit  (1),  /•  this  is  a vowel  */ 

39  2 al ternate_vowel  bit  (l);  /•  this  is  an  alternate  vowel,  (i.e,,  "y")  */ 
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then  goto  loop;  /*  keep  trying  until  a onme-letter  unit  is  found  */ 
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12  /•  generate  max_word3  and  write  them  all  out  */ 
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